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PINl PEPTIDYL PROLYL ISOMERASE POLYPEPTIDES, THEIR 
CRYSTAL STRUCTURES, AND USE THEREOF FOR DRUG DESIGN 

5 FIELD AND INDUSTRIAL APPLICABILITY OF THE INVENTION 

[0001] This application claims priority under 35 USC § 119 to U.S. Provisional 
Application No. 60/394,889. 

[0002] The present invention relates to mutant PINl polypeptides that lack a PINl 
WW domain and the polynucleotides that encode them. The invention also relates to 
10 the X-ray crystal structures of theses polypeptides. Additionally, the invention relates 
to crystallized complexes of the mutant PINl PPIase polypeptides and small entities 
that bind to the PINl PPIase substrate-binding domain. The invention also relates to 
the use of the atomic coordinates determined from such crystal structures for the use 
in drug design and development. 

15 BACKGROUND OF THE INVENTION 

[0003] The cell cycle represents a series of ordered processes that ultimately results in 
the duplication of a cell. Somatic cell division consists of two sequential processes, 
mainly DNA replication followed by chromosomal separation. The cell spends most 
of its time preparing for these events in a growth cycle (interphase), which in turn 

20 consists of three subphases: initial gap (Gt), synthesis (S), and secondary gap (G2). In 
Gi, the cell undergoes a high rate of biosynthesis. The S phase begins when DNA 
synthesis starts and ends when the DNA content of the nucleus has doubled. The cell 
then enters G2, which lasts until the cell enters the final phase of division, mitosis (M). 
The M phase begins with nuclear envelope breakdown, chromosome condensation 

25 and formation of two identical sets of chromosomes that are separated into two new 
nuclei. This is followed by cell division (cytokineis), which results in two daughter 
cells. This separation terminates the M phase and marks the beginning of interphase 
for the new cells. 

[0004] Entry into mitosis is a highly regulated event in normal cells. In eukaryotic 
30 cells studied to date, Cdc2/cyclin B, a Ser/Thr kinase, regulates entry into mitosis 
(Nurse, Nature 344:503-508 (1990)). To prevent inappropriate mitotic activity, the 
activity of Cdc2/cyclin B is tightly regulated. The Cdc/cyclin complex is both 
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positively and negatively regulated by phosphorylation. Cdc2/cyclin B, when 
activated by dephosphorylation by Cdc25, drives cells into mitosis. 

[0005] One regulator of Cdc25 is PINl, a peptidyl-prolyl isomerase (PPIase). PINl is 
a member of the parvulin family of PPIases and catalyzes rotation about the peptide 
5 bond preceding a proline residue. This reaction is suggested to be important in the 
folding and trafficking of some proteins (Schmid, Curr. Biol 5:993-994 (1995)). 
Other well-characterized PPIase families include the cyclophilins, and the FK506- 
binding proteins (FKBPs), which are targets of the immunosuppresive drugs 
cyclosporin A and FK506, respectively. Parvulins, such as PINl, the cyclophilins, 
10 and the FKBPs are unrelated in primary sequence. 

[0006] PINl has been identified in all eukaryotic organisms where examined, 
including plants, yeast, insects and mammals (Hanes et al.. Yeast 5:55-72 (1989); Lu 
et al., Nature 380:544-547 (1996); Maleszka et al., Proc, Natl Acad, Scl U.S.A. 
93:447-451 (1996)). The yeast (Essl) and Drosophila {dodo) PINl orthologues have 
15 high identity to human-expressed sequence tags, which ultimately led to the cloning 
of the human dodo gene called PINl (Maleszka et al., Gene 203:89-93 (1997)). The 
fly dodo gene is reported to be 45% identical to the yeast gene, Essl. 

^ [0007] Using a yeast two-hybrid screen of a human cDNA library, human PINl was 
originally identified as a binding protein of the fiingi Aspergillus nidulens protein 

20 NIMA, (Lu et al., 1996, supra). NIMA is a kinase that drives cells into mitosis and is 
reported to be negatively regulated by PINl . Depletion of NIMA in A. nidulans cells 
is reported to lead to cell cycle arrest in G2, while overexpression is reported to 
promote premature mitosis. Ser/Thr kinase Cdc2/cyclin B may be the analogous 
NIMA kinase in human cells, although another NIMA-like pathway in human cells is 

25 postulated to exist (Lu et al.. Cell 81:413-424 (1995)). 

[0008] Modulation of PINl activity is reported to result in dramatic morphological 
cellular phenotypes. For example, overexpression of PINl in Hela cells was reported 
to cause a G2 arrest while depletion caused mitotic arrest, the opposite phenotypes 
observed with NIMA modulation (Lu et al., 1996, supra\ Crenshaw et al., EMBO J. 
30 17:1315-1327 (1998)), Additionally, decreasing PINl protein expression by full- 
length antisense expression has been reported to cause cells to progress into mitosis 
prematurely, to contain aberrant nuclei due to premature chromosome condensation 
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and to induce apoptosis (Lu et al., 1996, supra). These data indicate that PlNl is a 
negative regulator of mitosis through interactions with a mammalian functional 
homologue of NIMA and is required for progression through mitosis. Further, 
depletion of PINl is also postulated to play a role Alzheimer's disease (Lu et al., 
5 Nature 399:784-788 (1999)). 

[0009] In vitro, PINl has been reported to interact with mitotic proteins also 
recognized by the MPM-2 antibody (Crenshaw et al., supra\ Lu et al., Science 
283:1325-1328 (1999); Ranganathan et al.. Cell 89:875-886 (1997); and Yaffe et al., 
Science 278:1957-1960 (1997)). The MPM-2 monoclonal antibody recognizes a 

10 phospho-SerA'hr-Pro epitope on about approximately 50 proteins associated with 
mitosis, including important mitotic regulators, such as Cdc25, Weel, Cdc27, Map 4, 
and NIMA (Davis et al., Proc. Natl Acad. Set. U.S.A. 80:2926-2930 (1983); Kuang et 
al., Proc. Natl Acad ScL UXA. 86:4982-4986 (1989); Westendorf et al., Proc. Natl 
Acad ScL UXA, 91:714-718 (1994); and Stuckenberg et al., CurrBiol 7:338-348 

15 (1997)). PINl has also been reported to interact with important upstream regulators 
of Cdc2/cyclin B including Cdc25 and its known regulator, Plxl (Shen et al., Genes 
Dev, 12:706 (1998); Crenshaw etal.,£MBO J. 17:1315-1327(1998)). PINl, due to 
its enzymatic action may remove Cdc25 and Plxl from play by causing their 
degradation within the cell. 

20 [0010] Studies indicate that the biological function of PINl depends on a functional 
PPIase active site (Lu et al., 1999, supra). Studies also indicate that PINl recognizes 
its substrates (mitosis-specific phosphoproteins) through its WW domain. The WW 
domain is a protein recognition motif that is prevalent throughout biology. However, 
the PINl WW domain is unique in that it requires its ligand protein to contain a 

25 phosphorylated serine. As with the PPIase domain, a fiinctional WW domain is 
reported to be essential for biological function of PINl . This is consistent with the 
model where PINl recognizes its substrates through the WW domain followed by 
completion of its essential catalytic role. 

[001 1] Full-length PINl protein and the nucleotide sequence encoding full-length 
30 PINl are disclosed in U.S. Patent Nos. 5,952,467 and 5,972,697. Sequence 

information for PINl amino-acid sequence and mRNA sequence have been deposited 
in GenBank under accession numbers NM006221 (mRNA) and S68520 (protein). 
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The mRNA sequence for dodo is deposited in GenBank under accession number 
U35 140. Mouse PINl mRNA sequence is deposited in GenBank under accession 
number NM_02337L 

[0012] Ranganathan, et al, {Cell, 89: 875-886 (1997); International Publication No. 

5 WO 99/63931; and U.S. Patent Application Publication No. US2001/0016346 Al) 
present the crystal structure of full-length PINl reportedly complexed with an AlaPro 
dipeptide. The atomic coordinates for the crystal structure reported by Ranganathan 
et al. are available in the Protein Data Bank (PDB). Information from the PDB 
internet site (http://www.rcsb.org/pdp/) indicates that this data was deposited on June 

10 21, 1998, and released on October 14, 1998. 

[0013] Neoplastic cells, due to their inherent genetic instability, have lost many of the 
control mechanisms regulating cell division. Such neoplastic cells are more 
susceptible to cell-cycle modulation or intervention as a means of inducing cell death 
by apoptosis. Further, because alterations in cell-cycle control are one of the 
15 differences between normal cells and cancer cells, proteins involved in cell-cycle 
control are attractive targets for developing cytotoxic agents effective for use in cell 
proliferative disorders. One such target is PINl . 

[0014] PINl inhibitors will be cytotoxic to cells and affect cells in the G2 phase of the 
cell cycle. Transformed cells will be hypersensitive to a PINl inhibitor due to their 
20 genomic instability and decreased and ineflBcient regulation of the cell cycle. 

[0015] Inhibitors of PINl have been described in the literature. For example, Hennig 
et al., {Biochemistry 37: 5953-5960 (1998)) report that juglone (5 -hydroxy- 1,4- 
naphthoquinone) selectively inhibits several parvulins, including human PINl. Noel 
et al. (International Publication No. WO 99/63931 and U.S. Patent Application 
25 Publication No. US201/0016346 Al), using data based on the crystal structure 

derived from fiill-length human PINl, describe certain compounds as being inhibitors 
of PINl . Lu et al. (International Publication No. PCT WO 99/12962) report 
inhibitors that mimic the phospho-Ser/Thr moiety of the phosphoserine or 
phosphothreonine-proline peptidyl prolyl isomerase substrate. 

30 [0016] Because of the important role that PINl plays in the regulation of the cell 
cycle, stable recombinant polypeptides containing the PINl PPIase binding domain 
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that are capable of manipulation for biochemical assays and crystallography studies 
are needed for the development of compounds that are modulators of PIN 1 PPIase 
activity. 

SUMMARY OF THE INVENTION 

5 [001 7] The present invention relates to polynucleotides and the polypeptides they 
encode. These polynucleotides encode for the PINl PPIase domain but do not encode 
for the PINl WW domain. The genetically engineered polypeptides encoded by the 
polynucleotides described herein may also contain discreet amino acid substitutions as 
compared to the wild-type PINl PPIase domain. The polypeptides described herein 
10 are advantageous over full-length wild-type PINl because they have better 

crystallization properties when crystallized with ligands that interact with the PPIase 
substrate-binding domain. 

[0018] One embodiment of the invention includes polynucleotides that encode for a 
PINl peptidyl-prolyl isomerase (PPIase) polypeptide that is devoid of the WW 
15 domain. 

[0019] A preferred embodiment is an isolated polynucleotide that encodes a 
polypeptide including the amino acid sequence of SEQ ID N0:2 and which does not 
have sequences that encode for a WW domain. 

[0020] Preferred is an isolated polynucleotide including the polynucleotide sequence 
20 of SEQ ID NO: 1 where the polynucleotide does not have sequences that encode for a 
WW domain. 

[0021] Another preferred polynucleotide is an isolated polynucleotide that encodes a 
polypeptide including the amino acid sequence of SEQ ID N0:4 and which does not 
have sequences that encode for a WW domain. 

25 [0022] Yet another preferred polynucleotide is an isolated polynucleotide including 
the polynucleotide sequence of SEQ ID N0:3 where the polynucleotide does not have 
sequences that encode for a WW domain. 

[0023] In a preferred embodiment, the polynucleotides described herein encode for at 
least one proteolytic cleavage site. A preferred cleavage site is a thrombin cleavage 
30 site. 
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[0024] In yet another preferred embodiment, the polynucleotides described herein 
include at least one sequence that encodes a histidine tag. 

[0025] The invention also relates to the isolated polypeptides encoded by the 
polynucleotides described herein. These polypeptides contain a PINl PPIase domain 
5 but not a WW domain. Preferred polypeptides include the isolated polypeptides 
having the amino acid sequences of SEQ ID N0:2 or SEQ ID N0:4. 

[0026] Another embodiment of the invention is a vector that includes at least one of 
the isolated polynucleotides described herein. A preferred vector includes a 
polynucleotide that encodes for a PINl PPIase but does not have sequences that 
10 encode for a WW domain. 

[0027] In a preferred embodiment, the vector is an expression vector that includes one 
of the polynucleotides described herein operably linked to a promoter. A preferred 
polynucleotide for expression is one that encodes for a PIN 1 PPIase but does not have 
sequences that encode for a WW domain. 

15 [0028] The invention also relates to a eukaryotic cell line or prokaryotic cell 
transformed or transfected with a vector that includes one of the polynucleotides 
described herein. Preferably the eukaryotic cell line or prokaryotic cell is transformed 
or transfected with a vector that includes a polynucleotide that encodes for a PINl 
PPIase but does not have sequences that encode for a WW domain. 

20 [0029] Another embodiment of the invention is a method of producing a PINl PPIase 
polypeptide where the method includes the following steps: (a) culturing a eukaryotic 
cell line or prokaryotic cell that has been transformed or transfected with a 
polynucleotide that encodes for a PINl PPIase and which does not have sequences 
that encode for a WW domain under conditions such that the polypeptide is 

25 expressed; and (b) recovering the polypeptide. 

[0030] The invention also relates to a method of assaying a compound for its PINl 
modulating ability. The method includes the following steps: adding a test compound 
to a polypeptide comprising a PINl peptidyl-prolyl isomerase wherein the 
polypeptide does not contain a WW domain; measuring the polypeptide's peptidyl- 
30 prolyl isomerase activity; and determining if the activity of the polypeptide is 
modulated by the test compound. 
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[003 1] A preferred method for assaying a compound for its PINl modulating ability 
is a high-throughput assay that includes the following steps: in a multiple vessel 
format, such as microwell plate, test compounds are added to a polypeptide 
comprising a PINl peptidyl-prolyl isomerase wherein the polypeptide does not 
5 contain a WW domain; measuring the polypeptide's peptidyl-prolyl isomerase 
activity; and determining if the activity of the polypeptide is modulated by the test 
compounds screened. 

[0032] Still another embodiment of the invention is a crystal structure of a PESfl 
PPIase polypeptide that is devoid of the WW domain. Preferred are crystal structures 
10 of the polypeptides having the amino acid sequence of SEQ ID NO:2, SEQ ID.N0:4, 
or fragments thereof. 

[0033] In a preferred embodiment the crystal structures diffract X-rays at a resolution 
value greater than or equal to 3 A. In a more preferred embodiment, the crystal 
structures diffract X-rays at a resolution value of greater than or equal to 2 A. 

15 [0034] In another preferred embodiment, the crystal structure of the PINl PPIase 
crystal structure has a three-dimensional structure characterized by the structure 
coordinates of Table II. 

[0035] Another embodiment of the invention is a crystal structure of a PINl PPIase 
polypeptide: ligand complex, wherein the polypeptide does not contain a WW domain. 
20 Preferably the polypeptide in the complex includes the amino acid sequence of SEQ 
IDNO:2orSEQIDNO:4. 

[0036] In a preferred embodiment, the crystal of the PINl PPIase polypeptide:ligand 
complex diffracts X-rays at a resolution of greater than or equal to 3.0 A. In a more 
preferred embodiment, the crystal structure diffracts X-rays at a resolution of greater 
25 than or equal to 2 A. 

[0037] In another preferred embodiment, the ligand in the PINl PPIase 

polypeptide: ligand complex is a modulator of PINl peptidyl-prolyl isomerase activity. 

A preferred modulator has the following formula: 
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[0038] Another embodiment of the invention is a PINl PPIase polypeptide:ligand 
complex crystal structure having a three-dimensional structure characterized by the 
structure coordinates of Table IIL 

[0039] The invention also relates to a method of using the three-dimensional structure 
5 of the PINl PPIase polypeptidexompound I complex as defined by the structure 
coordinates of Table III or a portion thereof in a drug discovery strategy including the 
following steps: 

(a) selecting a potential drug by using computer-aided drug design with the three- 
dimensional structure determined from one or more sets of atomic coordinates 

10 in Table III, wherein the selecting is performed in conjunction with computer 

modeling; 

(b) contacting, the potential drug with a polypeptide containing a functional PINl 
peptidyl-prolyl isomerase; and 

(c) detecting the binding of the potential drug with the polypeptide, wherein a 

15 potential drug is selected for further analysis if the potential drug binds to the 

polypeptide. 

[0040] Another preferred method described herein uses the three-dimehsional 

■ " ", . " ■ — 

structure of the PINl PPIase polypeptide:compound I complex as defined by the 
structure coordinates of Table III, or a portion thereof, in a drug discovery strategy 
20 that includes the following steps: 

(a) selecting a potential drug by using computer-aided drug design with the 
three-dimensional structure determined from one or more sets of structure 
coordinates in Table III, wherein the selecting is performed in conjunction 
with computer modeling; 

25 (b) contacting the potential drug with a polypeptide containing a functional 

PINl peptidyl-prolyl isomerase; and 

(c) determining if the potential drug modulates the peptidyl-prolyl isomerase 
activity of a polypeptide containing a PINl peptidyl-prolyl isomerase. 
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[0041] Also described is a method for evaluating the potential of a chemical entity to 
associate with a molecule or molecular complex including a binding pocket defined 
by structure coordinates of PINl PPIase amino acids His59, Leu61, Lys63, Sere?, 
Arg68, Arg69, Cysll3, Leul22, Metl30, GlnlBl, Phel34, Glul35, Thrl52, Serl54, 
5 and His 15 7, according to Table III, including the steps of: - 

(a) employing computational means to perform a fitting operation between the 
chemical entity and a binding pocket defined by structure coordinates of PINl 
PPIase amino acids His59, Leu61, Lys63, Ser67, Arg68, Arg69, Cysl 13, 
Leul22, Metl30, Glnl31, Phel34, Glul35, Thrl52, Serl54, and Hisl57, 

10 according to Table III; and 

(b) analyzing the results of the fitting operation to quantify the association 
between the chemical entity and the binding pocket. 

[0042] A method is described for evaluating the potential of a chemical entity to 
associate with a molecule or molecular complex . including a binding pocket defined 

15 by structure coordinates of PINl PPIase amino acids Arg54, Arg56, His59, Leu61, 
Lys63, Ser67, Arg68, Arg69, Ser72, Trp73, Serl 1 1, Aspl 12, Cysl 13, Serl 14, 
Serl 15, Alal 16, Lysl 17, Alal 18, Argl 19, Glyl20, Aspl21, Leul22, Glyl23, 
Alal24, Phel25, Serl26, Argl27, Glyl28, Glnl29, Metl30, Glnl31, Lysl32, Prol33, 
Phel34, Glul35, Thrl52, Aspl53, Serl54, and Hisl57 according to Table III, 

20 including the Steps of: ^ 'I |' . ^ ' ^ 

(a) employing computational means to perform a fitting operation between the 
chemical entity and a binding pocket defined by the structure coordinates of 
PINl PPIase amino acids Arg54, Arg56, His59, Leu61, Lys63, Ser67, Arg68, 
Arg69, Ser72, Trp73, Serl 11, Aspl 12, Cysl 13, Serl 14, Serl 15, Alal 16, 

25 Lysl 17, Alal 18, Argl 19, Glyl20, Aspl21, Leul22, Glyl23, Alal24, Phel25, 

Serl26, Argl27, Glyl28, Glnl29, Metl30, Glnl31, Lysl32, Prol33, Phel34, 
Glul35, Thrl52, Aspl 53, Serl 54, and Hisl57 according to Table III; and 

(b) analyzing the results of the fitting operation to quantify the association 
between the chemical entity and the binding pocket. 

30 [0043] Also described herein is a method for identifying a modulator of a molecule 
including a PINl PPIase substrate-binding domain including the steps of: 
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(a) using the structure coordinates of PIN I PPIase amino acids His59, Leu61, 
Lys63, Ser67, Arg68, Arg69, Cysl 13, Leul22, MetlSO, GlnlBl, Phel34, 
Glul35, Thrl52, Serl54, and Hisl57, according to Table III to generate a 
three-diniensional structure of molecule including a PINl PPIase or PPIase- 
like substrate-binding pocket; 

(b) employing the three-dimensional structure to design or select the 
modulator; 

(c) synthesizing or obtaining the modulator; and 

(d) contacting the modulator with the molecule to determine the ability of the 
modulator to interact with the molecule. 

[0044] Another method described for identifying a modulator of a molecule including 
a PINl PPIase substrate-binding domain includes the steps of: 

(a) using the structure coordinates of PIN PPIase amino acids Arg54, Arg56, 
His59, Leu61, Lys63, Ser67, Arg68, Arg69, Ser72, Trp73, Serl 1 1, Aspl 12, 
Cysl 13, Serl 14, Serl 15, Alall6, Lysll7, AlallS, Argll9, Glyl20, Aspl21, 
Leul22, Glyl23, Alal24, Phel25, Serl26, Argl27, Glyl28, Glnl29, Metl30, 
Glnl31, Lysl32, Prol33, Phel34, Glul35, Thrl52, Aspl53, Serl54, and 
Hisl57 according to Table III to generate a three-dimensional structure of the 
molecule including a PINl PPIase or PPIase-like substrate-binding pocket; 

(b) employing the three-dimensional structure to design or select the 
modulator; 

(c) synthesizing or obtaining the modulator; and 

(d) contacting the modulator with the molecule to determine the ability of the 
modulator to interact with the molecule. 

[0045] Yet another method for identifying a modulator of a molecule including a 
PINl PPIase substrate-binding domain includes the steps of: 

(a) using the structure coordinates of all the amino acids of PINl PPIase 
according to Table III to generate a three-dimensional structure of the 
molecule including a PINl PPIase or PPIase-like substrate-binding pocket; 
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(b) employing the three-dimensional structure to design or select the 
modulator; 

(c) synthesizing or obtaining the modulator; and 

(d) contacting the modulator with the molecule to determine the ability of the 
5 modulator to interact with the molecule. 

[0046] A preferred embodiment of the invention is a machine-readable medium 
having stored thereon data including the structure coordinates of a PINl PPIase 
substrate-binding site amino acids His59, Leu61, Lys63, Ser67, Arg68, Arg69, 
Cysl 13, Leul22, Metl30, Glnl31, Phel34, Glul35, Thrl52, Serl54, and Hisl57 
10 according to Table III. 

[0047] Another preferred embodiment is a machine-readable medium having stored 
thereon data including the structure coordinates of a PINl PPIase substrate-binding 
site amino acids Arg54, Arg56, His59, Leu61, Lys63, Ser67, Arg68, Arg69, Ser72, 
Trp73, Serin, Aspll2, Cysl 13, Serll4, SerllS, Alall6, Lysll7, AlallS, Argl 19, 
15 Glyl20, Aspl21, Leul22, Glyl23, Alal24, Phel25, Serl26, Argl27, Glyl28, 
Glnl29, Metl30, Glnl31, Lysl32, Prol33, Phel34, Glul35, Thrl52, Aspl53, 
Serl54, and His 157 according to Table III. 

[0048] Yet another preferred embodiment is a machine-readable medium having 
stored thereon data including all the structure coordinates of a PINl 
20 PPIase:Compound I complex according to Table HI. 

[0049] The invention also describes a method of obtaining structural information 
about a molecule or a molecular complex of unknown structure by using the structure 
coordinates set forth in Table III, including the steps of: 

(a) generating X-ray diffraction data from the crystallized molecule or 
25 molecular complex; and 

(b) applying at least a portion of the structure coordinates set forth in 
Table HI to the X-ray diffraction pattern to generate a three-dimensional 
electron density map of at least a portion of the molecule or molecular 
complex. 
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[0050] Another embodiment of the invention is a method for evaluating the ability of 
a compound to associate with a molecule or molecular complex comprising a PINl 
PPIase substrate-binding pocket. The method includes the steps of: 

(a) constructing a computer model of the binding pocket defined by the 
structure coordinates of PINl PPIase amino acids His59, Leu61, Lys63, Ser67, 
Arg68, Arg69, CyslB, Leul22, Kletl30, Glnl31, Phel34, Glul35, Thrl52, 
Ser 1 54, and Hisl 57 according to Table III; 

(b) selecting a compound to be evaluated by a method selected from the group 
consisting of (i) assembling molecular fragments into a compound, (ii) 
selecting a compound from a small molecule database, (iii) de novo ligand 
design of a compound, and (iv) modifying a known modulator, or a portion 
thereof, of a peptidyl-prolyl isomerase; 

(c) employing computational means to perform a fitting program operation 
between computer models of the compound to be evaluated and the binding 
pocket in order to provide an energy-minimized configuration of the 
compound in the binding pocket; and 

(d) evaluating the results of the fitting operation to quantify the association 
between the the compound and the binding pocket model, thereby evaluating 
the ability of the compound to associate with the binding pocket. 

[005 1] Yet another embodiment of the invention is a method for evaluating the ability 
of a compound to associate with a molecule or molecular complex comprising a PINl 
PPIase substrate-binding pocket. The method includes the steps of: 

(a) constructing a computer model of the binding pocket defined by structure 
coordinates of PINl PPIase amino acids Arg54, Arg56, His59, Leu61, Lys63, 
Ser67, Arg68, Arg69, Ser72, Trp73, Serl 1 1, Aspl 12, Cysl 13, Serl 14, 
Serll5, Alall6, Lysll7, AlallS, Argll9, Glyl20, Aspl21, Leul22, Glyl23, 
Alal24, Phel25, Serl26, Argl27, Glyl28, Glnl29, Metl30, Glnl31, Lysl32, 
Prol33, Phel34, Glul35, Thrl52, Aspl53, Serl54, and Hisl57 according to 
Table III; 

(b) selecting a compound to be evaluated by a method selected from the group 
consisting of (i) assembling molecular fragments into a compound, (ii) 
selecting a compound from a small molecule database, (iii) de novo ligand 
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design of a compound, and (iv) modifying a known modulator, or a portion 
thereof, of a peptidyl-prolyl isomerase; 

(c) employing computational means to perform a fitting program operation 
between computer models of the compound to be evaluated and the binding 
pocket in order to provide an energy-minimized configuration of the 
compound in the binding pocket; and 

(d) evaluating the results of the fitting operation to quantify the association 
between the compound and the binding pocket model, thereby evaluating the 
ability of the compound to associate with the binding pocket. 

[0052] Also disclosed is a method for identifying a modulator of a molecule 
comprising a PINl PPIase substrate-binding site, including the steps of 

(a) constructing a computer model of the the binding pocket defined by 
structure coordinates of PINl PPIase substrate-binding site amino acids His59, 
Leu61, Lys63, Ser67, Arg68, Arg69, Cysl 13, Leul22, MetlBO, GlnlBl, 
Phel34, Glul35, Thrl52, Serl54, and Hisl57 according to Table IH; 

(b) selecting a compound to be evaluated as a modulator by a method selected 
from the group consisting of (i) assembling molecular fi-agments into a 
compound, (ii) selecting a compound from a small molecule database, (iii) de 
novo ligand design of a compound, and (iv) modifying a known inhibitor, or a 
portion thereof, of a peptidyl-prolyl isomerase; 

(c) employing computational means to perform a fitting program operation 
between computer models of the compound to be evaluated and the binding 
pocket in order to provide an energy-minimized configuration of the 
compound in the binding pocket; 

(d) evaluating the results of the fitting operation to quantify the association 
between the compound and the binding pocket model, thereby evaluating the 
ability of the compound to associate with the binding pocket; 

(e) synthesizing the compound; and 

(f) contacting the compound with the molecule to determine the ability of the 
compound to modulate the PPIase activity of the molecule. 

[0053] A preferred embodiment is a method for identifying a modulator of a molecule 
comprising a PINl PPIase substrate-binding site, including the steps of 
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(a) constructing a computer model of the binding pocket defined by structure 
coordinates of PINl PPIase amino acids Arg54, Arg56, His59, Leu61, Lys63, 
Ser67, Arg68, Arg69, Ser72, Trp73, Serl 1 1, Aspl 12, Cysl 13, Serl 14, 
SerllS, Alall6, Lysl 17, Alall8, Argl 19, Glyl20, Aspl21, Leul22, Glyl23, 
Alal24, Phel25, Serl26, Argl27, Glyl28, Glnl29, Metl30, Glnl31, Lysl32, 
Prol33, Phel34, Glul35, Thrl52, Aspl53, Serl54, and Hisl57 according to 
Table HI; 

(b) selecting a compound to be evaluated as a potential activator or inhibitor 
by a method selected from the group consisting of (i) assembling molecular 
fragments into a compound, (ii) selecting a compound from a small molecule 
database, (iii) de novo ligand design of a compound, and (iv) modifying a 
known inhibitor, or a portion thereof, of a peptidyl-prolyl isomerase; 

(c) employing computational means to perform a fitting program operation 
between computer models of the compound to be evaluated and the binding 
pocket in order to provide an energy-minimized configuration of the 
compound in the binding pocket; 

(d) evaluating the results of the fitting operation to quantify the association 
between the compound and the binding pocket model, thereby evaluating the 
ability of the compound to associate with the binding pocket; 

(e) synthesizing the compound; and 

(f) contacting the compound with the molecule to determine the ability of the 
compound to modulate the PPIase activity of the molecule. 

[0054] Another method described herein for screening compounds for PINl PPIase 
modulating activity includes the steps of: 

(a) providing an assay buffer containing a Pintide-PINl PPIase polypeptide 
complex; 

(b) adding a test compound; and 

(c) measuring the disruption of the Pintide-PINl PPIase complex. 

[0055] A preferred embodiment for screening compounds for PINl PPIase 
modulating 

activity is a high-throughput screening method that includes the steps of: 
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(a) providing an assay buffer containing a Pintide-PINl PPIase polypeptide 
complex in a multiple-vessel format, such as a microwell plate; 

(b) adding test compounds; and 

(c) measuring the disruption of the Pintide-PINl PPIase complex in the 
multiple vessels. 

[0056] Another preferred embodiment for screening compounds for PINl PPIase 
modulating activity is a high-throughput screening method that includes the steps of: 

(a) providing an assay buffer containing a fluorscent-Pintide-PINl PPIase 
polypeptide complex in a multi-vessel format; j 

(b) adding test compounds; and 

(c) measuring the disruption of the fluorscent-Pintide-PINl PPIase complex in 
the multiple vessels. 

[0057] Yet another preferred embodiment for screening compounds for PINl PPIase 
modulating activity is a high-throughput screening method that includes the steps of: 

(a) providing an assay buffer containing a fluorscent-Pintide-PINl PPIase 
polypeptide complex in a multi-vessel format; 

(b) adding test compounds; and 

(c) measuring the disruption of the fluorscent-Pintide-PINl PPIase complex in 
the multiple vessels using fluorescence-polarization. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0058] This patent application file contains at least one drawling executed in color. 
Copies of this patent application publication with color drawing(s) will be provided 
by the U.S. Patent and Trademark Office upon request and payment of the necessary 
fee. 

[0059] Fig. 1 is a ribbon-and-stick drawing of the PPIase (K77Q/K82Q) domain 
structure with bound Compound I. Alpha helices are in red, beta strands in yellow, 
turns in blue, and connecting segments in green. The right-hand panel shows the 
structure of full-length PINl. 
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[0060] Fig. 2 A shows a close-up view of the PPIase (K77Q/K82Q) active site with 
Compound I depicted using stick bonds. Amino acid side chains in close proximity to 
Compound I are represented using stick bonds and colored green. 

[0061] Fig. 2B shows a close-up view of the PPIase (K77Q/K82Q) active site and the 
5 electron density for compound L 

[0062] Fig. 3 is a representation of the PPIase (K77Q/K82Q) solvent-accessible 
surface. Red represents hydrophobic regions and cyan represents hydrophilic regions. 

[0063] Fig. 4A lists the nucleotide sequence that encodes human PINl PPIase 
domain. 

10 [0064] Fig. 4B amino acid sequence of human PINl PPIase domain expressed from 
pET-28a after cleavage with thrombin. 

[0065] Fig. 5 A lists the nucleotide sequence that encodes mutant PPIase K77Q/K82Q. 

[0066] Fig. 5B lists the amino acid sequence of K77Q/K82Q expressed from pET-28a 
after cleavage with thrombin. 

15 [0067] Fig. 6 is a graphical representation of a calorimetric titration of Compound I 
with a His-tagged PINl PPIase. 

DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED 
EMBODIMENTS 

[0068] As used herein, the terms "comprising" and "including" are used in an open, 
20 non-limiting sense. 

[0069] The present invention uses conventional microbiological and recombinant 
DNA techniques known to those of ordinary skill in the art, See, e.g., Sambrook et al., 
"Molecular Cloning: A Laboratory Manual," 3'"* ed. (2001) Cold Spring Harbor Press, 
Cold Spring Harbor, N.Y.; Glover, ed., "DNA Cloning: A Practical Approach," 

25 Volumes I and II, 2"^ (1 995), IRL Press, Oxford; Ausbel et al., eds. "Current 

Protocols in Molecular Biology" (1994) Green Publishers Inc. and Wiley and Sons, 
New York; Innis et al., eds."PCR Protocols: A Guide to Methods and Applications" 
(1990) Academic Press, San Diego; Freshney "Culture of Animal Cells: A Manual of 
Basic Technique," 4* ed.(2000) Wiley & Sons; and Perbal, "A Practical Guide to 

30 Molecular Cloning," 2"^ ed. (1988) Wiley & Sons. 
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A. Nucleic Acids and Polynucleotides 

[0070] The present invention provides isolated nucleic acid molecules that encode 
mutant PINl PPIases domains with improved crystallography properties. Such 
improved properties include the ability to bind ligands better than wild-type PINl in a 
5 crystallized form, and the ability to be crystallized without phosphate or sulfate. In 
the absence of phosphate or sulfate, the substrate-binding pocket is more amenable for 
compound binding. 

[0071] The terms "nucleic acid molecule" and "polynucleotide" are used 
interchangeably in this application. These terms refer to any polyribonucleotide or 

10 polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA 
or DNA. These terms are intended to include DNA molecules (e.g., cDNA) and RNA 
molecules (e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide 
analogs. Exemplary polynucleotides include single- and double-stranded DNA, DNA 
that is a mixture of single- and double-stranded regions or single-, double- and triple- 

15 stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, double-stranded, or triple-stranded regions, or a 
mixture of single- and double-stranded regions. In addition, "polynucleotide" and 
"nucleic acid molecule" as used herein refer to triple-stranded regions composed of 

20 RNA or DNA, or both RNA and DNA. The strands in such regions may be from the 
same molecule or from different molecules. The regions may include all of one or 
more of the molecules, but more preferably involve only a region of some of the 
molecules. One of the molecules of a triple-helical region may be an oligonucleotide. 

[0072] Exemplary polynucleotides and nucleic acid molecules also include DNAs or 
25 RNAs as described above that contain one or more modified bases. Moreover, DNAs 
or RNAs comprising unusual bases, such as inosine, or modified bases, such as 
tritylated bases are exemplary polynucleotides. Exemplary polynucleotides and 
nucleic acid molecules also include chemically, enzymatically or metabolically 
modified forms of polynucleotides, as well as the chemical forms of DNA and RNA 
30 characteristic of viruses and cells, including, for example, simple and complex cells. 
Exemplary polynucleotides also include short polynucleotides referred to as 
oligonucleotides. 
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[0073] As used herein, the term "isolated" nucleic acid molecule means that the 
material is free of proteins and other nucleic acid present in the natural environment in 
which the material is normally found. In particular, the nucleic acid molecule is free 
of cellular components. Exemplary isolated nucleic acid molecules include PCR 
5 products, mRNA, cDNA, or restriction fragments. In another embodiment, an 
isolated nucleic acid is preferably excised from the chromosome in which it may be 
found, and more preferably is no longer joined to non-regulatory, non-coding regions, 
or to other genes, located upstream or downstream of the gene in its natural 
environment in the chromosome. In yet another embodiment^ the isolated nucleic 

10 acid lacks one or more introns. Isolated nucleic acid molecules can be inserted into 

I 

plasmids, cosmids, artificial chromosomes, and the like. Thus, in a specific 
embodiment, a recombinant nucleic acid is an isolated nucleic acid. Moreover, an 
"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free 
of other cellular material, or culture medium when produced by recombinant 
15 techniques, or chemical precursors or other chemicals when chemically synthesized. 
However, the nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. 

[0074] For example, a recombinant DNA molecule contained in a vector is 
considered isolated. Further examples of isolated DNA molecules include 
20 recombinant DNA molecules maintained in heterologous host cells or purified 
(partially or substantially) DNA molecules in solution. Exemplary isolated RNA 
molecules include in vivo or in vitro RNA transcripts of the isolated DNA molecules 
described herein. Exemplary isolated nucleic acid molecules further include such 
molecules produced synthetically. 

25 [0075] Full-length genes or portions thereof may be cloned using any one of a 

number of suitable methods known in the art. For example, a method that employs 
XL-PCR (Perkin-Elmer, Foster City, Calif.) to amplify long pieces of DNA may be 
used. 

[0076] The isolated nucleic acid molecules can encode functional polypeptides plus 
30 additional amino or carboxyl-terminal amino acids, such as those that, e.g., facilitate 
protein trafficking, prolong or shorten protein half-life, or facilitate manipulation of a 
protein for assay or production. Once a full-length gene is cloned, portions of the 
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gene, such as the PPIase domain, can be obtained using known techniques. The 
isolated nucleic acid molecules of the invention include the sequence encoding the 
active PPIase alone or in combination with other coding sequences, such as a leader 
or secretory sequence (e.g., a pre-pro or pro-protein sequence), the sequence encoding 
5 the PPIase domain, with or without the additional coding sequences, plus additional 
non-coding sequences, for example, introns and non-coding 5' and 3' sequences, such 
as transcribed but non-translated sequences that play a role in transcription, mRNA 
processing (including splicing and polyadenylation signals), ribosome binding, and 
stability of mRNA. In addition, the nucleic acid molecule may be fused to a marker 
10 sequence encoding, for example, a peptide that facilitates purification. 

[0077] Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or 
in the form of DNA, including cDNA and genomic DNA, obtained by cloning or 
produced by known chemical synthetic techniques or by a combination thereof. The 
nucleic acid, especially DNA, can be double-stranded or single-stranded. Single- 
15 stranded nucleic acid can be the coding strand (sense strand) or the non-coding strand 
(antisense strand). 

[0078] The invention further provides nucleic acid molecules that encode functional 
fragments or variants of PINl PPIases. Such nucleic acid molecules may be 
constructed by known recombinant DNA methods or by chemical synthesis. Such 
20 non-naturally occurring variants may be made by mutagenesis techniques, including 
those applied to nucleic acid molecules, cells, or organisms. Accordingly, the variants 
can contain nucleotide substitutions, deletions, inversions and insertions. Variation , 
can occur in either or both the coding and non-coding regions. The variations can 
produce both conservative and non-conservative amino acid substitutions. 

25 [0079] The nucleic acid molecules of the present invention are useful for producing 
peptides for use in crystallization studies, drug discovery, and drug design. The 
nucleic acid molecules can also be used as primers for PGR to amplify any given 
region of a nucleic acid molecule and are also useful to synthesize antisense 
molecules of desired length and sequence. 

30 [0080] The nucleic acid molecules are also useful for constructing recombinant 
vectors. Such vectors include expression vectors that express a portion of, or all of, 
the peptide sequences. Vectors also include insertion vectors, used to integrate into 
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another nucleic acid molecule sequence, such as into the cellular genome, to alter in 
situ expression of a gene and/or gene product. For example, an endogenous coding 
sequence can be replaced via homologous recombination with all or part of the coding 
region containing one or more specifically introduced mutations. 

5 [0081] The nucleic acid molecules are also useful for constructing host cells 
expressing a part, or all, of the nucleic acid molecules and peptides. 

Vectors and Host Cells 

[0082] The invention also provides vectors containing the nucleic acid molecules 
described herein. When the vector is a nucleic acid molecule, the nucleic acid 

10 molecules described herein are covalently linked to the vector nucleic acid. 

Exemplary vectors for this embodiment of the invention include plasmids, single- or 
double-stranded phage, single- or double-stranded RNA or DNA viral vector, or 
artificial chromosome, such as a BAC, PAC, YAC, or MAC. Various expression 
vectors can be used to express the polynucleotides of the invention, such as pET and 

15 pProEX. 

[0083] A vector can be maintained in the host cell as an extrachromosomal element 
where it replicates and produces additional copies of the nucleic acid molecules. 
Alternatively, the vector may integrate into the host cell genome and produce 
additional copies of the nucleic acid molecules when the host cell replicates. 

20 [0084] The vectors can be used for the maintenance (cloning vectors) or expression 
(expression vectors) of the nucleic acid molecules. The vectors can function in 
prokaryotic or eukaryotic cells or in both (shuttle vectors). 

[0085] Expression vectors contain cis-acting regulatory regions that are operably 
linked in the vector to the nucleic acid molecules such that transcription of the nucleic 

25 acid molecules is allowed in a host cell. The nucleic acid molecules can be 

introduced into the host cell with a separate nucleic acid molecule capable of affecting 
transcription. Thus, the second nucleic acid molecule may provide a trans-acting 
factor interacting with the cis-regulatory control region to allow transcription of the 
nucleic acid molecules from the vector. Alternatively, the host cell may supply a 

30 trans-acting factor. Finally, a trans-acting factor can be produced from the vector 
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itself. It is understood, however, that in some embodiments, transcription and/or 
translation of the nucleic acid molecules can occur in a cell-free system. 

[0086] Exemplary regulatory sequences to which the nucleic acid molecules 
described herein can be operably linked include promoters for directing mRNA 
5 transcription. These include the left promoter from bacteriophage X, the lac promoter, 
TRP, and TAC promoters from E. coli, the early and late promoters from SV40, the 
CMV immediate early promoter, the adenovirus early and late promoters, and 
retrovirus long-terminal repeats. 

[0087] The term "operably linked" as used herein indicates that a gene and a 
10 regulatory sequence, such as a promoter, are connected in such a way as to permit 
gene expression when the appropriate molecules (e.g., transcriptional activator 
proteins or proteins which include transcriptional activation domains) are bound to the 
regulatory sequence. 

[0088] In addition to control regions that promote transcription, exemplary expression 
15 vectors also include regions that modulate transcription, such as repressor binding 
sites and enhancers. Illustrative embodiments include the SV40 enhancer, the 
cytomegalovirus immediate early enhancer, polyoma enhancer, adenovirus enhancers, 
and retrovirus LTR enhancers. 

[0089] In addition to containing sites for transcription initiation and control, 
20 exemplary expression vectors can contain sequences necessary for transcription 

termination. These vectors may also contain signals necessary for translation such as 
a ribosome-binding site. Other exemplary regulatory control elements for expression 
include initiation and termination codons as well as polyadenylation signals. Other 
examples of regulatory sequences are described, for example, in Sambrook et al., 
25 2001, supra. 

[0090] A variety of expression vectors can be used to express a nucleic acid molecule. 
Examples of such vectors include chromosomal, episomal, and virus-derived vectors, 
for example, vectors derived from bacterial plasmids, from bacteriophage, from yeast 
episomes, from yeast chromosomal elements, including yeast artificial chromosomes, 
30 and from viruses such as baculoviruses, papovaviruses such as SV40, vaccinia 
viruses, adenoviruses, poxviruses, pseudorabies viruses, and retroviruses. Vectors 



I 
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may also be derived from combinations of these sources, such as those derived from 
plasmid and bacteriophage genetic elements, e.g., cosmids and phagemids. 
Appropriate cloning and expression vectors for prokaryotic and eukaryotic hosts are 
described in Sambrook et al., 2001, supra. 

5 [0091] The regulatory sequence may provide constitutive expression in one or more 
host cells (i.e. tissue specific) or may provide for inducible expression in one or more 
cell types such as by temperature, nutrient additive, or exogenous factor such as a 
hormone or other ligand. Suitable vectors providing for constitutive and inducible 
expression in prokaryotic and eukaryotic hosts are known in the art. 

10 [0092] The nucleic acid molecules can be inserted into the vector nucleic acid by 
known methodology. For example, the DNA of interest is joined to a vector by 
cleaving the DNA sequence and the vector with one or more restriction enzymes and 
then ligating the fragments together. 

[0093] The vector containing the appropriate nucleic acid molecule can be introduced 
15 into an appropriate host cell for propagation or expression using known techniques. 
Appropriate bacterial host cells include E. coli, Streptomyces, and Salmonella 
typhimurium. Appropriate eukaryotic host cells include yeast, insect cells, animal 
cells such as COS and CHO, and plant cells. 

[0094] In a preferred embodiment, a peptide as described herein is expressed as a 
20 fusion protein. Accordingly, the invention also provides fusion vectors that allow for 
the production of such peptides. Fusion vectors can increase the expression of a 
recombinant protein, increase the solubility of the recombinant protein, and/or aid in 
the purification of the protein by acting, for example, as a ligand for affinity 
purification. A proteolytic cleavage site may be introduced at the junction of the 
25 fusion moiety so that the desired peptide can ultimately be separated from the fusion 
moiety. Exemplary proteolytic enzymes include factor Xa, thrombin, and 
enterokinase. Illustrative fusion expression vectors include pGEX (Smith et al., Gene 
67:31-40 (1988)), pET28a (Novagen, Madison, WI), pMAL (New England Biolabs, 
Beverly, MA), and pRJT5 (Pharmacia, Piscataway, NJ), which fiise glutathione S- 
30 transferase (GST), maltose E binding protein, or protein A, respectively, to the target 
recombinant protein. Examples of suitable inducible non-fusion E. coli expression 
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vectors include pTrc (Amann et al., Gene 69:301-315 (1988)) and pET 1 Id (Studier et 
ah, Gene Expression Technology: Methods in Enzymology, 185:60-89 (1990)). 

[0095] Recombinant protein expression can be maximized in a host bacteria by 
providing a genetic background wherein the host cell has an impaired capacity to 
5 proteolytically cleave the recombinant protein. (Gottesman, Gene Expression 
Technology: Methods in Enzymology, 185:119-128(1990)). Alternatively, the 
sequence of the nucleic acid molecule of interest can be altered to provide preferential 
codon usage for a specific host cell, for example, E, coli. (Wada et aL, Nucleic Acids 
i?e5.20:2111-2118(1992)). 

10 [0096] The nucleic acid molecules can also be expressed by expression vectors that 
are operative in yeast. Examples of vectors for expression in yeast, e.g. 5. cerevisiae, 
include pYepSecl (Baldari, et al., EMBOJ. 6:229-234 (1987)), pMFa (Kurjan et al.. 
Cell 30:933-943 (1982)), pJRY88 (Schultz et al., Gene 54:1 13-123 (1987)), and 
pYES2 (Invitrogen Corporation, San Diego, CA). 

15 [0097] The nucleic acid molecules can also be expressed in insect cells using, for 
example, baculovirus expression vectors. Exemplary, baculovirus vectors available 
for expression of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc 
series (Smith et al., MoL Cell Biol. 3:2156-2165 (1983)) and the pVL series (Lucklow 
et al., Virology 170:31-39 (1989)). 

20 [0098] In a preferred embodiment of the invention, the nucleic acid molecules 
described herein are expressed in mammalian cells using mammalian expression 
vectors. Examples of mammalian expression vectors include pCDM8 (Seed, Nature 
329:840 (1987)) and pMT2PC (Kaufman et al., EMBOJ, 6:187-195 (1987)). 

[0099] Preferred expression vectors include pET28a (Novagen, Madison, WI), 
25 pAcSG2 (Pharmingen, San Diego, CA), pProEx (Life Technologies, Gaithersburg, 
MD) and pFastBac (Life Technologies). Other vectors suitable for maintenance 
propagation or expression of the nucleic acid molecules described herein are known in 
the art. For example, suitable vectors and methods for using and propagating vectors 
are discussed in Sambrook et ah, 2001, supra. 

30 [0100] The invention also relates to recombinant host cells containing the vectors 
described herein. Exemplary host cells include prokaryotic cells, lower eukaryotic 
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cells such as yeast, other eukaryotic cells such as insect cells, and higher eukaryotic 
cells such as mammalian cells. 

[0101] The recombinant host cells are prepared by introducing the vector constructs 
described herein into the cells by techniques available in the art. These include 
5 calcium phosphate transfection, DEAE-dextran-mediated transfection, cationic lipid- 
mediated transfection, electroporation, transduction, infection, lipofection. See also, 
Sambrook et al., 2001, supra. 

[0102] The recombinant host cells expressing the peptides described herein have a 
variety of uses. For example, the cells are useful for producing the polypeptides of 
10 the invention, which can be used for crystallography studies, biochemical studies, and 
drug discovery. 

[0103] Host cells can contain more than one vector. Thus, different nucleotide 
sequences can be introduced on different vectors of the same cell. Similarly, the 
nucleic acid molecules can be introduced either alone or with other nucleic acid 
15 molecules that are not related to the nucleic acid molecules, such as those providing 
trans-acting factors for expression vectors. When more than one vector is introduced 
into a cell, the vectors can be introduced independently, co-introduced, or joined to 
the PPIase polynucleotide vector. 

[0104] In the case of bacteriophage and viral vectors, these can be introduced into 
20 cells as packaged or encapsulated virus by standard procedures for infection and 

transduction. Viral vectors can be replication-competent or replication-defective. In 
the case in which viral replication is defective, replication will occur in host cells 
providing functions that complement the defects. 

[0105] Exemplary vectors include selectable markers that enable the selection of the 
25 subpopulation of cells that contain the recombinant vector constructs. The marker can 
be contained in the same vector that contains the nucleic acid molecules described 
herein or may be on a separate vector. Exemplary markers include tetracycline or 
ampicillin-resistance genes for prokaryotic host cells, and dihydrofolate reductase or 
neomycin resistance for eukaryotic host cells. However, any marker that provides 
30 selection for a phenotypic trait may be used. 

B. Peptides, Proteins and Antibodies 
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[0106] The following amino acid abbreviations are used herine: A=Ala=Alanine; 
V^Val=Valine; L=Leu=Leucine; I=Ile=Isoleucine; P=Pro=Proline; 
F=Phe=Phenylalanine; W=Trp-Tryptophan; M=Met=Methionine; G=Gly=Glycine; 
S=Ser=Serine; T=Thr=Threonine; C=Cys=<^ysteine; Y=Tyr^Tyrosine; 
5 N=Asn=Asparagine; Q=Gln=Glutamine; D=Asp=Aspartic Acid; E=Glu=Glutamic 
Acid; K=Lys=Lysine; R=Arg=Arginine; and H=His=Histidine. 

[0107] As used herein, the terms "peptidyl-prolyl isomease" and "PPIase" refer to 
enzymes that accelerate the cis/tram isomerization of peptide bonds preceding prolyl 
residues. 

10 [0108] The term "mutant PINl PPIase" means a polypeptide which contains a PINl ^ 
PPIase domain but which is devoid of the PINl WW domain. These mutant PINl 
PPIase polypeptides may also contain discrete amino acid substitutions in their PPIase 
domain. 

[0109] "Polypeptide" refers to any peptide or protein comprising two or more amino 
15 acids joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres. "Polypeptide" refers to both short chains, commonly referred to as peptides, 
oligopeptides or oligomers, and to longer chains, generally referred to as proteins. 
The terms "peptide", "polypeptide" and "protein" are used interchangeably herein. 

[0110] As used herein, a peptide is said to be "isolated" or "purified" when it is 
20 substantially free of homologous cellular material or chemical precursors or other 
chemicals. The peptides of the present invention can be purified to homogeneity or 
other degrees of purity. The level of purification will be selected based on the 
intended use, such that the preparation allows for the desired function of the peptide, 
even if in the presence of considerable amounts of other components. 

25 ' [01 1 1] In some embodiments, "substantially free of cellular material" means 

preparations of the peptide having less than about 30% (by dry weight) other proteins 
(i.e., contaminating protein). In preferred embodiments the peptide preparation 
contains less than about 20% other proteins, more preferably less than about 10% 
other proteins, or even more preferably less than about 5% other proteins. When the 

30 peptide is recombinantly produced, it can also be substantially free of culture medium, 
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i.e., culture medium represents less than about 20% of the volume of the protein 
preparation. 

[0112] The language "substantially free of chemical precursors or other chemicals" 
refers to preparations of the peptide in which it is separated from chemical precursors 
5 or other chemicals that are involved in its synthesis. The term "substantially free of 
chemical precursors or other chemicals" means preparations of the mutant PINl 
PPIase polypeptides having less than about 30% (by dry weight) chemical precursors 
or other chemicals. In preferred embodiments the peptide preparations have less than 
about 20% chemical precursors or other chemicals, more preferably less than about 
10 .10% chemical precursors or other chemicals, or even more preferably less than about 
5% chemical precursors or other chemicals. 

[0113] The isolated mutant PPIase polypeptides described herein can be purified from 
cells that have been altered to express it (recombination), or synthesized using known 
protein synthesis techniques. For example, a nucleic acid molecule encoding the 
15 PPIase polypeptide is cloned into an expression vector, the expression vector 

introduced into a host cell and the protein expressed in the host cell. The protein can 
then be isolated from the cells by an appropriate purification scheme using standard 
protein purification techniques. ' 

[01 14] While the polypeptides of the invention can be produced in bacteria, yeast, 
20 mammalian cells, and other cells under the control of the appropriate regulatory 

sequences, cell-free transcription and translation systems can also be used to produce 
these proteins using RNA derived from the DNA^constructs described herein. 

[0115] Where secretion of the peptide is desired, appropriate secretion signals are 
incorporated into the vector. The signal sequence can be endogenous to the peptides 
25 or heterologous to these peptides. 

[01 16] It is also understood that, depending upon the host cell in recombinant 
production of the peptides described herein, the peptides can have various 
glycosylation patterns, depending upon the cell, or non-glycosylated, as when 
produced in bacteria. In some embodiments, the peptides may include an initial 
30 modified methionine as a result of a host-mediated process. 
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[0117] The present invention also provides variants of the above-described peptides, 
such as allelic/sequence variants of the peptides, and non-naturally occurring 
recombinantly derived variants of the peptides. Such variants can be generated using 
techniques that are known by those skilled in the fields of recombinant nucleic acid 
5 technology and protein biochemistry. 

[01 1 8] Such variants can readily be made or identified using molecular techniques 
and the sequence information disclosed herein. Further, such variants can readily be 
distinguished from other peptides based on sequence and/or structural homology to 
the peptides of the present invention. 

10 [0119] To determine the percent identity of two amino acid sequences or two nucleic 
acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). In a preferred embodiment, the length of a reference 

15 sequence aligned for comparison purposes is at least 30%, preferably 40%, more 
preferably 50%, even more preferably 60% or more, of the length of the reference 
sequence. In a preferred embodiment, the length of a reference sequence aligned for 
comparison purposes is at least 70%, preferably 80%, more preferably 90% or more, 
of the length of the reference sequence. The amino acid residues or nucleotides at 

20 corresponding amino acid positions or nucleotide positions are then compared. When 
a position in the first sequence is occupied by the same amino acid residue or 
nucleotide as the corresponding position in the second sequence, then the molecules 
are identical at that position (as used herein amino acid or nucleic acid "identity" is 
equivalent to amino acid or nucleic acid "homology"). The percent identity between 

25 the two sequences is a function of the number of identical positions shared by the 
sequences, taking into account the number of gaps, and the length of each gap, which 
need to be introduced for optimal alignment of the two sequences. 

[0120] The comparison of sequences and determination of percent identity and 
similarity between two sequences can be accomplished using a mathematical 
30 algorithm. (Lesk, ed., "Computational Molecular Biology" (1988) Oxford University 
Press, New York; Smith, ed., "Biocomputing: Informatics and Genome Projects" 
(1993) Acadeniic Press, New York ; Griffin et al., eds., "Computer Analysis of 
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Sequence Data, Part 1" (1994) Humana Press, New Jersey; von Heinje, "Sequence 
Analysis in Molecular Biology" (1987) Academic Press; and Gribskov et al. eds., 
"Sequence Analysis Primer" (1991) Stockton Press, New York). For example, the 
percent identity between two amino acid sequences is determined using the 
5 Needleman et al. algorithm (/. MoL Biol 48:444-453 (1970), which has been 
incorporated into commercially available computer programs, such as GAP in the 
GCG software package, using either a Blossom 62 matrix or a PAM250 matrix, and a 
gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. The 
percent identity between two nucleotide sequences can also be determined using the 

10 commercially available computer programs including the GAP program in the GCG 
software package (Devereux et al., Nucleic Acids Res, 12(1):387 (1984)), the NWS 
gap DNA CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight 
of 1, 2, 3, 4, 5, or 6. The percent identity between two amino acid or nucleotide 
sequences can be determined using the algorithm of Meyers et al. (CABIOS, 4:1 1-17 

15 (1 989)), which has been incorporated into commercially available computer 

programs, such as ALIGN (version 2.0), using a PAM120 weight residue table, a gap 
length penalty of 1 2 and a gap penalty of 4. 

[0121] The nucleic acid and protein sequences of the present invention can fiirther be 
used as a "query sequence" to perform a search against sequence databases to, for 

20 example, identify other family members or related sequences. Such searches can be 
performed using commercially available search engines, such as the NBLAST and 
XBLAST programs (version 2.0) of Altschul et al. (J. MoL Biol 215:403-10 (1990)). 
Nucleotide searches can be performed with such programs to obtain nucleotide 
sequences homologous to the nucleic acid molecules of the invention. Protein 

25 searches can be performed with such programs to obtain amino acid sequences 
homologous to the proteins of the invention. To obtain gapped alignments for 
comparison purposes, Gapped BLAST can be utilized as described in Altschul et al. 
{Nucleic Acids Res. 25(17):3389-3402 (1997)). 

[0122] Peptides can be routinely identified as having a high degree (significant) of 
30 sequence homology/identity to the peptides of the present invention. As used herein, 
two proteins (or a region of the proteins) have "significant homology" when the . 
amino acid sequences are typically at least about 70-75% homologous. In preferred 
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embodiments, the homology is 80-85%, and more preferably at least about 90-95%. 
A significantly homologous amino acid sequence will be encoded by a nucleic acid 
sequence that will hybridize to a peptide encoding nucleic acid molecule under 
stringent conditions. 

5 [01 23] Non-naturally occurring variants of the polypeptides of the present invention 
can be generated using recombinant techniques. Such variants include deletions, 
additions and substitutions in the amino acid sequence of the PPIase domain. For 
example, one class of substitutions are conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid in a peptide by another 

10 amino acid of like characteristics. Exemplary conservative substitutions are the 
replacements, one for another, among the aliphatic amino acids (Ala, Val, Leu, and 
lie); interchange of amino acids containing a hydroxyl residue (Ser and Thr); 
exchange of amino acids containing an acidic residue (Asp and Glu); substitution 
between amino acids containing an amide residue (Asn and Gin); exchange of amino 

15 acids containing a basic residue (Lys and Arg); and replacements among amino acids 
containing an aromatic residue (Phe, Tyr). Guidance concerning which amino acid 
changes are likely to be phenotypically silent is found in Bowie et al.. Science 
247:1306-1310(1990). ' 

[0124] Variant PINl PPIases can be fully functional or may have reduced or 
20 decreased activity when compared to;;.the wild-typb protein. Fully functional variants. . 
may contain conservative variation or variation in non-critical residues or in noil- 
critical regions. Functional variants can also contain substitution of similar amino 
acids, not affecting function that result in no change or an insignificant change in 
function. Alternatively, such substitutions may positively or negatively affect 
25 function to some degree. 

[0125] Exemplary non-functional variants are those having one or more non- 
conservative amino acid substitutions, deletions, insertions, inversions, or truncations 
of the particular polypeptide, or a substitution, insertion, inversion, or deletion in a 
critical residue or critical region of the polypeptide. 

30 [0126] Amino acids that affect function can be identified by methods known in the 
art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham 
et al., 1989, Science 244:1081-1085). The latter procedure introduces single alanine 
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mutations at every residue in the molecule. The resulting mutant molecules are then 
tested for biological activity, for example, by measuring enzymatic activity. Sites that 
are critical for binding can also be determined by structural analysis, such as by X-ray 
crystallography, nuclear magnetic resonance, or photoaffinity labeling (Smith et al., J. 
5 Mol Biol 224:899-904 (1992); de Vos et al.. Science 255:306-312 (1992)). 

Accordingly, the peptides of the present invention also include derivatives or analogs: 
in which a substituted amino acid residue is not one encoded by the genetic code; in 
which a substituent group is included; in which the polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for 
10 example, polyethylene glycol); or in which the additional amino acids are fiised to the 
polypeptide, such as a leader or secretory sequence or a sequence for purification of 
the polypeptide. 

[0127] The present invention further provides for functional, active fragments of the 
PINl PPIase domain. A "fragment" is a variant polypeptide having an amino acid 

15 sequence that is entirely the same as part but not all of any amino acid sequence of 
any polypeptide of the invention. As with the mutant PINl polypeptides of the 
invention, fragments may be free-standing or comprised within a larger polypeptide of 
which they form a part or region; most preferably they are a single continuous region 
in a single larger polypeptide. As used herein, a "fragment" comprises at least 8 or 

20 more contiguous amino acid residues from the protein PPIase domain. Such 

fragments can be chosen based on the ability to retain the biological activity of the 
PPIase domain or based on the ability to perform a function, e.g., act as an 
immunogen. Preferred are fragments that are catalytically active and that have 
improved crystallography properties as compared to full-length wild-type PINl . Such 

25 fragments will preferably comprise a domain or motif of the PPIase, e.g., active site or 
binding site. 

[0128] Polypeptides may contain amino acids other than the 20 amino acids 
commonly referred to as the 20 naturally occurring amino acids. Further, many 
amino acids, including the terminal amino acids, may be modified by natural 
30 processes, such as byprocessing and other post-translational modifications, or by 
chemical modification techniques known in the art. Known modifications include 
acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, 
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covalent attachment of a heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent crosslinks, formation of cystine, 
5 formation of pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
proteolytic processing, phosphorylation, phenylation, racemization, selenoylation, 
sulfation, transfer-RNA mediated addition of amino acids to proteins such as 
arginylation, and ubiquitination. Modifications, such as glycosylation, lipid 

10 attachment, sulfation, gamma-carboxylation of glutamic acid; residues, hydroxylation 
and ADP-ribosylation, for instance, are described in most ba^ic texts, such as 
Creighton, "Proteins-Structure and Molecular Properties," 2nd ed. (1993) W. H, 
Freeman and Company, New York. Reviews on this subject include Wold, 
"Posttranslational Covalent Modification of Proteins," Johnson, ed., Academic Press, 

15 New York 1-12 (1983); Seifter et al. {Meth EnzymoL 182: 626-646 (1990)); and 
Rattan et al. (Ann. N. Y. Acad ScL 663:48-62 (1992)). 

[01 29] In some embodiments, the peptides can be attached to heterologous sequences 
to form chimeric or fusion proteins. Such chimeric and fusion proteins comprise a 
peptide operatively linked to a heterologous protein having an amino acid sequence 

20 not substantially homologous to the PPIase peptide. "Operatively linked" indicates 
that the peptide and the heterologous protein are fused in-frame. The heterologous 
protein can be fused to the N-terminus or C-terminus of the PPIase peptide. The two 
peptides linked in a fusion peptide are preferrably derived from two independent 
sources, and therefore such a fusion peptide comprises two linked peptides not 

25 normally found linked in nature. 

[0130] In some embodiments, the fusion protein does not affect the activity of the 
peptide perse. For example, the fiision protein can include, enzymatic fusion proteins 
or affinity tags, for example, beta-galactosidase fusions, yeast two-hybrid GAL 
fusions, His-tags, MYC-tags, green fusion protein, and Ig fusions. Such fusion 
30 proteins can facilitate the purification of the polypeptides described herein. In certain 
host cells (e.g., mammalian host cells), expression and/or secretion of a protein can be 
increased by using a heterologous signal sequence. 
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[0131] A chimeric or fusion protein can be produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the diflferent protein sequences 
are ligated together in-frame in accordance with conventional techniques. In another 
embodiment, the fusion gene can be synthesized by conventional techniques, 

5 including automated DNA synthesizers. Alternatively, PGR amplification of gene 
fragments can be carried out using anchor primers which give rise to complementary 
overhangs, between two consecutive gene fragments, which can subsequently be 
annealed and re-amplified to generate a chimeric gene sequence (see Ausubel et al., 
1992 supra). Moreover, many expression vectors are commercially available that 

10 already encode a fusion moiety (e.g., a GST protein, His-tag, or green fluorescent 
protein). A nucleic acid encoding a PPIase polypeptide can be cloned into such an 
expression vector such that the fusion moiety is linked in-frame to the PPIase 
polypeptide. 

[0132] The polypeptides can be used for rapid-screening methods (high-throughput 
15 screening) to identify compounds that inhibit or modulate PINl PPIase activity. The 
high-throughput screening assay can be fiilly automated on robotic workstations. The 
assay may employ radioactivity, fluorescence, or other materials useful for detection. 

[0133] "High-throughput screening" as used herein refers to an assay that provides 
for multiple-candidate agents or samples to be screened simultaneously. Preferably 
20 the number of agents or samples screened is greater than one, more preferably greater 
than 100, and even more preferably greater than 300. Such assays may include the 
use of microtiter plates or other vessel containing apparatus that allows a large 
number of assays to be carried out simultaneously, usmg small amounts of reagents 
and samples. 

25 C. Crystallization and Drug Design 

r 

[0134] Crystals of the polypeptides of the invention or ligand complexes of such 
polypeptides can be grown by a number of known techniques, including batch 
crystallization, vapor diffrision (either by sitting drop or hanging drop), and 
microdialysis. Seeding of the crystals in some instances is required to obtain X-ray 
30 quality crystals. Standard micro and/or macro seeding of crystals may therefore be 
used. As exemplified below, PINl PPIase-Compound I complex was prepared by 
diluting PINl PPIase to 10 mg/ml, then exposing it to Compound I dissolved in 100% 
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DMSO to a final concentration of ImM. The resulting protein/Compound I solution 
was then incubated for 24 hours at 4^C, and filtered through a 0.45-|j.M cellulose- 
acetate membrane prior to setting up crystallization experiments. Under these 
conditions, crystals grew within 3 days. 

5 [0135] Once a crystal of the present invention is grown, X-ray diffraction data can be 
collected. X-ray diffraction data collection can be obtained using, for example, an 
MAR-imaging plate detector. Crystals can be characterized by using X-rays produced 
in a conventional source (such as a sealed tube or a rotating anode) or using a 
synchrotron source (provided by, e.g., the Stanford University Synchrotron Radiation 
10 Laboratory). 

[0136] Data processing and reduction can be carried out using programs such as 
DENZO/SCALEPACK (HKL Research, Inc., Charlottesvilee, VA; Otwinowski et al., 
Metk EnzymoL 276:307-326 (1997)). In addition, X-PLOR (Brunger, "X-PLOR:A 
System for X-ray Crystallography and NMR," Yale University Press, New Haven, 

15 Conn (1992)) or Heavy (Terwilliger, Los Alamos National Laboratory) may be 

utilized for bulk solvent correction and B-factor scaling. Electron density maps can 
be calculated using SHARP (La Fortelle et al., Meth. Enzymol. 276:472-494 (1997)) . 
and SOLOMON (Abrahams et al., Acta Cryst, D52:30-42 (1996)). Molecular models 
can be built into this map using O (Jones et al., ACTA Crystallogr. A47:l 10-1 19 

20 (1991)), XTALVIEW (Scripps Research, La JoUa, CA) or QUANTA98 (Accelrys, 
Inc. San Diego, CA). Refinement can be done using X-PLOR (Brunger, 1992, 
supra,), using the free R-value to monitor the course of refinement. 

[0137] Once the three-dimensional structure of a crystal comprising a PINl PPIase or 
a PINl PPIase-complex is determined, a potential ligand (antagonist or agonist) is 

25 examined through the use of computer modeling using a docking program such as 
FelxiDock (Tripos, St. Louis, MO), GRAM (Medical Univ. Of South Carolina), 
DOCK (Univ. of California at San Francisco), Glide (Schrodinger, Portland, OR), 
Gold (Cambridge Crystallographic Data Centre, UK), FlexX (BioSolvelT GmbH, 
Germany); AGDOCK (Gehlhaar et al., Chemistry & Biol 2:3 1 7-324 (1 995); Bouzida 

30 et al., Pacific Symp. on Biocomputing '99, 426-437 (1 999); Bouzida et al., Internat J 
of Quantum Chem. 72:73-84 (1999); Gehlhaar et al., Proceedings of the Seventh Ann, 
Conf on Evolutionary Programming, The MIT Press, Cambridge, MA (1998); Hex 
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(Ritchie et al., Proteins: Struct. Funct. & Genet, 39:178-194 (2000); all incorporated 
herein by refemce), or AUTODOCK (Scripps Research Institute, La Jolla, CA). This 
modeling procedure can include computer fitting of potential ligands to the PPIase 
substrate-binding domain to ascertain how well the shape and the chemical structure 
5 of the potential ligand will complement or interfere with the PPIase substrate-binding 
domain (Bugg et al., Scientific American Dec.:92-98 (1993); West et al., TIPS, 16:67- 
74 (1995)). 

[0138] Computer programs can also be employed to estimate the attraction, repulsion, 
and steric hindrance of the ligand to the PPIase-binding domain. For example, one 

10 can screen computationally small molecule databases for chemical entities or 

compounds that can bind in whole, or in part, to PIN l PPIase. In this screening, the 
quality of fit of such entities or compounds to the binding site may be judged either by 
shape complementarity or by estimated interaction energy (Meng, et al., J. Comp. 
Chem., 13:505-524 (1992)). Generally, the tighter the fit (e.g., the lower the steric ' 

15 hindrance and/or the greater the attractive force), the more potent the drug is projected 
to be since these properties are consistent with a tighter-binding constant. 

[0139] "Binding domain," also referred to as "binding site," "binding pocket," 
"substrate-binding site," "catalytic domain," or "substrate-binding domain," refers to 
a region or regions of a molecule or molecular complex, that, as a result of its shape, 

20 can associate with another chemical entity or compound. Such regions are of utility 
in fields such as drug discovery. The association of natural ligands or substrates with 
binding pockets of their corresponding receptors or enzymes is the basis of many 
biological mechanisms of action. Similarly, many drugs exert their biological effects 
via an interaction with the binding pockets of a receptor or enzyme. Such interactions 

25 may occur with all or part of the binding pocket. An understanding of such 

interactions can facilitate the design of drugs having more favorable and specific 
interactions with their target receptor or enzyme and, thus, improved biological 
effects. Therefore, information related to ligand binding with the PINl substrate- 
binding site is valuable in facilitating the design and discovery of modulators of PINl. 

30 Furthermore, the more specificity in the design of a potential drug the more likely that 
the drug will not interact with other similar proteins, thus minimizing potential side 
effects due to unwanted cross interactions. 
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[0140] Initially, a potential ligand can be obtained by screening a random chemical 
library. A ligand selected in this manner could be then be systematically modified by 
computer-modeling programs until one or more promising potential ligands are 
identified. Such analysis has been shown to be useful in the design of, for example, 
5 fflV protease inhibitors (Lam et al.. Science 263:380-384 (1994); Wlodawer et al., 
Ann, Rev. Biochem. 62:543.-585 (1993); Appelt, Perspectives in Drug Discovery and 
Design 1:23-48 (1993); Erickson, Perspectives in Drug Discovery and Design 1: 109- 
128 (1993). Additionally, directed or focused libraries can be constructed as a means 
of modifying compounds previously identified as ligands from screening a random 
10 chemical library. Using this method, a number of different compounds can be 

synthesized that systematically explore a particular portion of the ligand-binding site 
and then tested for activity against the protein of interest. For example, in compound 
I, the phenyl group could be replaced with substituents that have different physical 
and chemical properties than the phenyl group. 

15 [0141] Such computer modeling allows the selection of a finite number of rational 
chemical modifications, as opposed to the potentially unlimited number of essentially 
random chemical modifications that could be made, any one of which might lead to a 
drug. Each chemical modification requires additional chemical steps, which, while 
being reasonable for the synthesis of a finite number of compounds, quickly becomes 

20 overwhelming if all possible modifications needed to be synthesized. Thus, through 
the use of the structure coordinates disclosed herein and computer modeling, a large 
number of these compounds can be rapidly modeled via a computer, and a few 
promising candidates can be determined without the laborious synthesis of a 
multitude of compounds. 

25 [0142] Once a potential ligand (agonist or antagonist) is identified, it can be either 
selected from commercial libraries of compounds or alternatively the potential ligand 
may be synthesized de novo. The prospective drug can be tested in the binding assay 
exemplified below to test its ability to bind to the PPIase substrate-binding domain, or 
it can be tested for its ability to modulate PINl PPIase activity. 

30 [0143] The term "modulates" refers to the ability of a compound to alter the function 
of a peptidyl-prolyl isomerase, such as PINl. For example, a compound modulates 
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the activity of a peptidyl-prolyl isomerase if it either increases or decreases the 
peptidyl-prolyl isomerase activity of the peptidyl-prolyl isomerase protein. 

[0144] When a suitable compound is identified, a supplemental crystal can be grown 
that comprises a protein-ligand complex formed between the PINl PPlase domain and 
5 the compound. Preferably, the crystal effectively diffracts X-rays allowing the 

determination of the atomic coordinates of the protein-ligand complex to a resolution 
of greater than or equal to 3,0 A, more preferably greater than or equal to 2,0 A. 
Molecular Replacement Analysis can be used to determine the three-dimensional 
structure of the supplemental crystal. 

10 [0145] Molecular replacement involves using a known three-dimensional structure as 
a search model to determine the structure of an identical or closely related molecule or 
protein-ligand complex in a new crystal form. The measured X-ray difiraction 
properties of the new crystal are compared with those calculated from the search 
model structure to compute the position and orientation of the protein in the new 

15 crystal. Computer programs that can be used for this purpose include: X-PLOR 
(Brunger, 1992, supra, EPMR (Kissinger et al. Acta CrysL D55:484-491 (1999); 
incorporated herein by refemce), ProLSQ (Konnert et al., Acta Cryst, A36:344-350 
(1 980)), and AMORE (J. Navaza, Acta Crystallographies ASO, 157-163 (1994)). 
Once the position and orientation are known, an electron density map can be 

20 calculated using the search model to provide X-ray phases. Thereafter, the electron 
density is inspected for structural differences and the search model is modified to 
conform to the new structure. Using this approach, the structure may be used to solve 
the three-dimensional structures of any such PINl PPlase polypeptide-ligand 
complex. Other computer programs that can be used to solve the structures of such 

25 PINl PPlase crystals include QUANTA (Accelrys, Inc., San Diego, CA), INSIGHT 
(Accelrys, Inc., San Diego, CA), ARP/wARP (European Molecular Biology 
Laboratory, Heidelberg, Germany; Perrakis et al., Nature Struc. Biol. 6:458-463 
(1999); Lamzin et al.. Acta Crv5/.D49: 129-147 (1993)), and ICM (MolSoft, La Jolla, 
CA) 

30 [0146] For all of the drug design strategies described herein, successive iterations of 
any and/or all of the steps provided by the aforementioned procedures are typically 
performed to yield one or more ligands with improved properties (e.g., activity). 
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[0147] Another aspect of the invention involves using the structure coordinates 
generated from the PPIase-ligand complex to generate a three-dimensional shape. 
This is achieved through the use of commercially available software that is capable of 
generating three-dimensional graphical representations of molecules or portions 
5 thereoffrom a set of structure coordinates. 

[0148] In resolving the crystal structure of a mutant PINl PPIase polypeptide as 
described below, the PINl amino acids that define the shape of the PINl PPIase 
substrate-binding domain were determined. For example, one component of the 
PPIase substrate-binding domain is the surface formed by amino acids Leu61, 

10 Cysll3, Serl 14, SerllS, Alall6, Lysll7, AlallS, Argll9, Glyl20, Aspl21, Leul22, 
Glyl23, Alal24, Phel25, Serl26, Argl27, Glyl28, Glnl29, and Metl30. These 
residues play a part in binding (hydrophobic interaction). Arg54, Lysl 17, and Glnl29 
can also form electrostatic interactions with entities that bind in the PINl PPIase 
substrate-binding site. Arg54, Arg56, Serl 1 1, Lysl32, and Asp 1 53, although slightly 

15 away from the direct ligand interaction, could interact with niodified or larger ligands. 
Additionally, the prolyl pocket includes His59, Leul22, Phel34, Metl30, Hisl57, 
Thrl52, Serl54, Glnl31, and Cysll3. Lys63, Ser67, Arg68 and Arg69 are relevant 
to electrostatic interactions. The interaction of Lys63 and Ser67 can be direct or 
indirect, such as with water mediation. Further, the crystal structure indicates a 

20 Glnl3I pocket; with potential mterac|ipn to Glnl31, Thrl52:, GluU5, and Prol33. 
Still further, a Trp73 pocket is formed by amino acids Arg69, Serl l4, Ser72, Trp73, 
Aspl 12 and Alal 16. There is a potential covalent adduct to Cysl 13. Thus, a binding 
pocket defined by the structural coordinates of these amino acids, as set forth in Table 
III, or a binding pocket whose root-mean-square deviation from the structure 

25 coordinates of the backbone atoms of these amino acids that is not more than about 
0.5 A, is a PINl PPIase or PPIase-like substrate-binding domain of this invention. 
Depictions of the PINl PPIase substrate-binding site are shovm in Figs. 1-3. 

[0149] It will be readily apparent to those of skill in the art that the numbering of 
amino acids in other isoforms of PINl may be different than that set forth herein. 
30 Corresponding amino acids in other isoforms of PINl are readily identified by 

inspection of the amino acid sequences, for example, through the use of commercially 
available homology software programs. 



PC19084/AG0116-US 

38 

[0150] The amino acids of the PPIase domain of the polypeptides of the invention are 
described herein in reference to the set of structure coordinates set forth in Tables II 
and HI. The terms "structure coordinates" and "atomic coordinates" refer to Cartesian 
coordinates derived from mathematical equations related to the patterns obtained on 
5 diffraction of a monochromatic beam of X-rays by the atoms (scattering centers) of a 
protein or protein-ligand complex in crystal form. The diffraction data are used to 
calculate an electron density map of the repeating unit of the crystal. The electron 
density maps are then used to establish the positions of the individual atoms of the 
enzyme or enzyme complex. 

10 [0151] The variations in coordinates discussed above may be generated because of 
mathematical manipulations of the PINl PPIase-Compound I complex structure 
coordinates. For example, the structure coordinates set forth in Table III may be 
manipulated by crystallographic permutations of the structure coordinates, 
fractionalization of the structure coordinates, integer additions, subtractions to sets of 

15 the structure coordinates, coordinate transformations, e.g., translation or rotation, or 
combinations thereof 

[0152] Alternatively, modifications in the crystal structure due to mutations, 
additions, substitutions, and/or deletions of amino acids, or other changes in any of 
the components that make up the crystal may also account for variations in structure 

20 coordinates. If such variations are within an acceptable standard error as compared to 
the original coordinates, the resulting three-dimensional shape is considered to be the 
same. Thus, for example, a ligand that has bound to the binding pocket of the mutant 
PPIase domain would also be expected to bind to another binding pocket whose 
structure coordinates, when compared to those described, have a root-mean-square 

25 difference of equal to or less than about 0.5 A from the backbone atoms. 

[0153] Various computational analyses can be performed to determine whether a 
polypeptide or the binding pocket portion thereof is sufficiently similar to the PPIase 
binding pocket as described herein. Such analyses may be carried out through the use 
of known software applications, such as the MODELLER module of INSIGHT II 
30 (Accebys, Inc., San Diego, CA), ProMod (University of Geneva, Switzerland), 
SWISS-MODEL (Swiss Institute of Bioinformatics), and the Molecular Similarity 
application of QUANTA (Accelrys, Inc., San Diego, CA). 
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[0154] Programs such as QUANTA (Accelrys, Inc., San Diego, CA), INSIGHT U 
(Accelrys, Inc., San Diego, CA), Maestro (SchrOdinger, Portland, OR ), SYBYL 
(Tripos, Inc., St. Louis, MO), and MacroModel (Schrodinger, Portland, OR) permit 
comparisons between different structures, different conformations of the same 
5 structure, and different parts of the same structure. Comparison of structures using 
such computer software may involve the following steps: 1) loading the structures to 
be compared; 2) defining the atom equivalencies in the structures; 3) performing a 
fitting operation; and 4) analyzing the results. 

[0155] In comparing structures, each structure is identified by a name. One structure 
10 is identified as the target (i.e., the fixed structure); all remaining structures are 

working structures (i.e., moving structures). Since atom equivalency with QUANTA 
is defined by user input, as defined herein "equivalent atoms" refers to protein 
backbone atoms (N, Ca, C, and O) for all conserved residues between the two 
structures being compared. 

15 [0156] When a rigid-fitting method is used, the working structure is translated and 
rotated to obtain an optimum fit with the target structure. The fitting operation uses 
an algorithm that computes the optimum translation and rotation to be applied to the 
moving structure, such that the root-mean-square difference of the fit over the 
specified pairs of equivalent atoms is an absolute minimum. This number, given in 

20 angstroms (A), is reported by software applications such as QUANTA (Accelrys, Inc., 
San Diego, CA) or other similar programs. Any molecule or molecular complex or 
binding pocket thereof that has a root-mean-square deviation of conserved residue 
backbone atoms (N, Ca, C, O) of less than about 0.5 A when superimposed on the 
relevant backbone atoms described by structure coordinates listed in Table III are 

25 considered identical. 

[0157] The term "root-mean-square deviation" means the square root of the 
arithmetic mean of the squares of the deviations from the mean. It is a way to express 
the deviation or variation from a trend or object. As used herein, the "root-mean- 
square deviation" defines the variation in the backbone of a protein from the 
30 backbone of the PINl PPIase polypeptides of the invention or the PINl PPIase 
substrate-binding domain portion thereof, as defined by the structure coordinates 
described herein. 
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D. Computers, Computer Software, Computer Modeling 

[0158] As discussed above, a computer may be used for producing a three- 
dimensional representation of the PPIase substrate-binding domain. Suitable 
computers are known in the art and typically include a central processing unit (CPU), 
5 and a working memory, which can be random-access memory, core memory, mass- 
storage memory, or a combination thereof. The CPU may encode one or more 
programs. Computers also typically include display, input and output devices, such as 
one or more cathode-ray tube display terminals, keyboards, modems, input lines and 
output lines. Further, computers may be networked to computer servers (the machine 
10 on which large calculations can be run in batch) and file servers (the main machine 
for all the centralized databases). 

[0159] Machine-readable media containing data, such as the crystal structure 
coordinates of the polypeptides, may be inputted using various hardware, including 
modems, CD-ROM drives, disk drives, or keyboards. 

15 [0160] Machine-readable data medium can be, for example, a floppy diskette, hard 
disk, or an optically-readable readable data storage medium, which can be either read 
only memory, or rewritable, such as a magneto-optical disk. 

[0161] Output hardware, such as a CRT display terminal, may be used for displaying 
a graphical representation of the substrate-binding site of the PPIase polypeptides 
20 described herein. Output hardware may also include a printer and disk drives. 

[0162] The CPU coordinates the use of the various input and output devices, 
coordinates data accesses from storage and accesses to and from working memory, 
and determines the sequence of data processing steps. A number of programs may be 
used to process the machine-readable data. Such programs are discussed herein in 
25 reference to the computational methods of drug discovery. 

[0163] In a preferred embodiment of the invention, X-ray coordinate data capable of 
being processed into a three-dimensional graphical display of a molecule or molecular 
complex that comprises a PPIase or PPIase-like substrate-binding pocket are stored in 
a machine-readable storage medium. The three-dimensional structure of a molecule 
30 or molecular complex comprising a PPIase or PPIase-like substrate-binding pocket is 
useful for a variety of purposes in drug discovery and drug design. 
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[0164] For example, the three-dimensional structure derived from the structure 
coordinate data may be computationally evaluated (computer-aided drug design) for 
its ability to associate with chemical entities (Butt et al., Scientific American Dec.:92- 
98 (1993); West et al., TIPS 16:67-74 (1995); Dunbrack et al., Folding & Designl 

5 2:27-42 (1997)). The term "chemical entity," as used herein, refers to a chemical 
compound, a complex of at least two chemical compounds, or a fragment of such a 
compound or complex. Such entities are potential drug candidates and can be 
evaluated for their ability to inhibit or modulate the activity of PIN 1. The ability of 
an entity to bind to, or associate with a PINl PPIase or PPIase-like substrate-binding 

10 domain, depends on the features of the entity alone. Assays to determine if a 
compound binds to PINl are known in the art, such as those exemplified herein. 

[0165] The design of compounds that bind to a PINl PPIase or PPIase-like substrate- 
binding domain may involve consideration of two factors. First, the entity must be 
capable of physically and structurally associating with some or the entire PINl PPIase 

15 or PPIase-like substrate-binding domain. The term "associating with" refers to a 

condition of proximity between a chemical entity and a binding pocket or binding site 
on a protein. The association may be non-covalent, for example, wherein the 
juxtaposition is energetically favored by hydrogen bonding of van der Waals or 
electrostatic interactions, or it may be covalent. Non-covalent molecular interactions 

20 contributing to this association include hydrogen bonding, van der Waals interactions, 
hydrophobic interactions, and electrostatic interactions. 

[0166] Second, the entity must be able to assume a conformation that allows it to 
associate with the PENl PPIase or PPIase-like substrate-binding domain directly. 
Although certain portions of the entity will not directly participate in these 

25 associations, those portions of the entity may still influence the overall conformation 
of the molecule. This, in turn, may have a significant impact on potency. Such 
conformational requirements include the overall three-dimensional structure and 
orientation of the chemical entity in relation to all or a portion of the binding pocket, 
and the spacing between functional groups of an entity comprising several chemical 

30 entities that directly interact with the PINl PPIase or PPIase-like binding pocket. 

[0167] The potential inhibitory or binding effect of a chemical entity on a PINl 
PPIase or PPIase-like substrate-binding domain may be analyzed prior to its actual 
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synthesis and testing through the use of computer-modeling techniques. If from the 
theoretical structure of the given entity it can be surmised that there is insufficient 
interaction and association between it and the PINl PPIase or PPIase-like-binding 
pocket, further testing of the entity may not be prudent. However, if computer 
5 modeling uidicates a strong interaction, the molecule can be synthesized and tested 
for its ability to bind to a PINl PPIase or PPIase-like binding pocket. This may be 
achieved by testing the ability of the molecule to modulate PINl PPIase activity using 
the assays described in herein. Using this scheme, the fruitless synthesis of 
compounds with poor binding activities may be avoided. 

10 [0168] A potential inhibitor of a PINl PPIase or PPIase-like substrate-binding domain 
may be computationally evaluated (computer-aided drug design) by means of a series 
of steps in which chemical entities are screened and selected for their ability to 
associate with the PINl PPIase or PPIase-like binding pockets. One skilled in the art 
may use one of several methods to screen chemical entities or fragments for their 

15 ability to associate with a PINl PPIase or PPIase-like substrate-binding domain. For 
example, the artesian may visually inspect a PINl PPIase or PPIase-like substrate- 
binding pocket on a computer screen based on the PIN l PPIase structure coordinates 
reported in Table HI or other coordinates that define a similar shape generated from 
the machine-readable storage medium. Selected chemical entities may then be 

20 positioned in a variety of orientations, or docked, within that binding pocket as 
described herein. Docking may be accomplished using software such as Quanta 
(Accelrys, Inc., San Diego, CA) and SYBYL (Tripos, Inc., St. Louis, MO), followed 
by energy minimization and molecular dynamics with standard molecular mechanics 
force fields, such as CHARMM (Department of Chemistry & Chemical Biology, 

25 Harvard Univ., Cambridge, MA) and AMBER (School of Pharmacy, Department of 
Pharmaceutical Chemistry, University of California at San Francisco, CA) 

[0169] Specialized computer programs to assist in the process of selecting chemical 
entities include those described in the following references, which are incorporated by 
reference herein: 

30 1 . GRID (Goodford, "A Computational Procedure for Determining Energetically 
Favorable Binding Sites on Biologically Important Macromolecules," J. Med Chem. 
28:849-857 (1985)). GRID is available from the Oxford University, Oxford, UK. 
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2. MCSS (Miranker et al., "Functionality Maps of Binding Sites: A Multiple Copy 
Simultaneous Search Method," Proteins: Struct. Funct, and Genet. 1 1:29-34 (1991)). 
MCSS is available from Accelrys, Inc., San Diego, CA. 

3. AUTODOCK (Goodsell et al., "Automated Docking of Substrates to Proteins by 
5 Simulated Annealing", Proteins: Struct. Funct. and Genet, 8:195-20 (1 990)). 

AUTODOCK is available from the Scripps Research Institute, La JoUa, Calif. 

4. DOCK (Kuntz et al., "A Geometric Approach to Macromolecule-Ligand 
Interactions," J. Mol Biol, 161:269-288 (1982)). DOCK is available from the 
University of Califomia, San Francisco, CA. 

10 5. GOLD (Jones et al., "Development and Validation of a Genetic Algorithm for 
Flexible Docking," 1 Mol Biol 267:727-748 (1997)). GOLD is available from the 
Cambridge Crystallographic Data Centre, UK. 

6. GLIDE (Eldridge et al., "Empirical Scoring Functions: 1. The Development of a 
Fast Empirical Scoring Function to Estimate the Binding Affinity of Ligands in 
15 Receptor Complexes," 1 Comput. Aided MoL Des, 1 1 :425-445 (1997)). Glide is 
available from SchrSdinger, Portland OR. 

[0170] Once suitable chemical entities have been selected, they can be assembled into 
a single compound or complex. Assembly may be preceded by visual inspection of 
the relationship of the fragments to each other on the three-dimensional image 
20 displayed on a computer screen in relation to the structure coordinates of PIN 1 PPIase 
or a PINl PPIase-ligand complex. This can be followed by manual model building 
using software such as Quanta or SYBYL. Usefiil programs to aid one of skill in the 
£ut in connecting the individual chemical entities also include those described in the 
following references, which are incorporated by reference herein: 

25 1. CAVEAT (Bartlett et al., "CAVEAT: A Program to Facilitate the Structure- 
Derived Design of Biologically Active Molecules", Molecular Recognition in 
Chemical and Biological Problems'', Special Pub., Royal Chem. Soc, 78, pp. 182-196 
(1989); Lauri et al., "CAVEAT: a Program to Facilitate the Design of Organic 
MokculQs",! Comput. Aided MoL Des. 8:51-66(1994)). CAVEAT is available 

30 from the University of Califomia, Berkeley, CA. 
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2. ISIS: See Martin, "3D Database Searching in Drug Design," 7. Med. Chem. 
35:2145-2154 (1992)). ISIS is available from MDL Information Systems, San 
Leandro, CA. 

3. HOOK (Eisen et al., "HOOK: A Program for Finding Novel Molecular 

5 Architectures that Satisfy the Chemical and Steric Requirements of a Macromolecule 
Binding Site," Proteins: Struct, Funct,, Genet,, 19:199-221 (1994)). HOOK is 
available from Accelrys, Inc., San Diego, CA. 

[0171] Instead of proceeding to build an inhibitor of a PINl PPIase or PPIase-like 
substrate-binding pocket in a step-wise fashion one chemical entity at a time as 

10 described above, inhibitory or other PINl PPIase-binding compounds may be 
designed as a whole or de novo using either an empty binding site or optionally 
including some portion(s) of a known inhibitor(s). There are many known de novo 
ligand design methods, such as LeapFrog (available from Tripos Associates, St. 
Louis, MO.) and those discussed in the following references, which are incorporated 

15 by reference herein. 

1 . LUDI (Bohm, "The Computer Program LUDI: A New Method for the De novo 
Design of Enzyme Inhibitors," J, Comp. Aid. Molec. Design. 6:61-78 (1992)); LUDI" 
is available from Accelrys Inc., San Diego, CA. 

2. SPROUT (Gillet et al., "SPROUT: A Program for Structure Generation," J, 
20 Comput. Aided MoL Design. 7:127-153 (1993)). SPROUT is available from the 

University of Leeds, UK. 

[0172] Other molecular modeling techniques may also be employed (see, e.g., Cohen 
et al., J. Med. Chem. 33:883-894 (1990); Navia et al., Curr. Opin. Struct. Biol T.l^l- 
210 (1992); Balbes et al., Reviews in Computational Chemistry, Vol. 5, K. Lipkowitz 
25 et al., eds., VCH, New York, pp. 337-380 (1994); Guida, Curr. Opin. Struct Biol 
4:777-781 (1994)). 

[0173] Once a chemical entity has been designed or selected by using such methods, 
the efficiency with which that entity may bind to a PINl PPIase substrate-binding 
pocket may be tested and optimized by computational evaluation. For example, an 
30 effective PINl PPIase substrate-binding-pocket inhibitor preferably demonstrates a 
relatively small difference in energy between its bound and fi^e states (i.e., a small 
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deformation energy of binding), PINl PPIase substrate-binding pocket inhibitors may 
interact with the substrate-binding domain in more than one conformation that is 
similar in overall binding energy. In those cases, the deformation energy of binding is 
taken to be the difference between the energy of the free entity and the average energy 
5 of the conformations observed when the inhibitor binds to the protein. 

[0174] An entity designed or selected as binding to a PINl PPIase substrate-binding 
domain may be further computationally optimized so that in its bound state it would 
preferably lack repulsive electrostatic interaction with the target enzyme and with the 
surrounding water molecules. Such non-complementary electrostatic interactions 
10 include repulsive charge-charge, dipole-dipole and charge-dipole interactions. 

[0175] Suitable computer software is available to evaluate compound deformation 
energy and electrostatic interactions. Examples of programs designed for such uses 
include: Gaussian (Frisch, Gaussian, Inc., Carnegie, PA); AMBER (Kollman, 
University of California at San Francisco); Jaguar (Schrodinger, Portland, OR); 

15 SPARTAN (Wavefunction, Inc., Irvine, CA); QUANTA/CHARMM (Accelrys, Inc., 
San Diego, CA.); Impact (Schrodinger, Portland, OR ); Insight Il/Discover (Accelrys, 
Inc., San Diego, CA); MacroModel (Schrodinger, Portland, OR ); Maestro 
(Schrodinger, Portland, OR ); DelPhi (Accelrys, Inc., San Diego, CA); and AMSOL 
(Quantum Chemistry Program Exchange, Indiana University). These programs may 

20 be implemented, for instance, using workstations produced by companies, such as 
Silicone Graphics, Hewlet Packard, Sun Microsystems, and International Business 
Machines. 

[0176] In another approach small-molecule databases are computationally screened to 
determine their potential to bind in whole, or in part, to a PINl PPIase or PPIase-like 
25 substrate-binding pocket. In this screening, the quality of fit of such entities to the 
binding site may be judged either by shape complementarity or by estimated 
interaction energy (Meng et al. J, Comp. Chem, 13:505-524 (1992)). Binding of 
potential modulators can be assessed biochemically, for example, using isothermal 
titration calorimetry as described herein. 

30 [0177] The structure coordinates set forth in Table III can be used to obtain structural 
information about another crystallized molecule or molecular complex. This may be 
achieved by any suitable known technique, such as molecular replacement. By using 
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molecular replacement, all or part of the structure coordinates of the mutant PINl 
PPIase polypeptide:Compound I complex can be used to determine the structure of a 
crystallized molecule or molecular complex whose structure is unknown. This 
process is more efficient than attempting to determine such information ab initio. 

5 [0178] Molecular replacement provides an accurate estimation of the phases for an 
unknown structure. Phases constitute a factor in equations used to solve crystal 
structures that cannot be determined directly. Obtaining accurate values for the 
phases, by methods other than molecular replacement, is a time-consuming process 
that involves iterative cycles of approximations and refinements and greatly hinders 
10 the solution of crystal structures. However, when the crystal structure of a protein 
containing at least a homologous portion has been solved, the phases from the known 
structure can provide a an estimate of the phases for the unknown structure. 

[0179] The method involves generating a preliminary model of a molecule or 
molecular complex whose structure coordinates are unknown, by orienting and 

15 positioning the relevant portion of the mutant PINl PPIaserCompound I complex 
according to Table III within the unit cell of the crystal of the unknown molecule or 
molecular complex so as best to theoretically account for the observed X-ray 
diffraction data of the crystal of the molecule or molecular complex whose structure is 
unknown. Phases can then be calculated from this model and combined with the 

20 observed X-ray diffraction data amplitudes to generate an electron density map of the 
structure whose coordinates are unknown. This, in turn, can be subjected to any 
known model building and structure refinement techniques to provide a final, accurate 
structure of the unknown crystallized molecule or molecular complex (Lattman, Meth, 
EnzymoL 1 15:55-77 (1985); Rossmann, ed., "The Molecular Replacement Method," 

25 InL Sci, Rev, Ser., No. 13, Gordon & Breach, New York (1 972)). Thus, the structure 
of any portion of any crystallized molecule or molecular complex that is sufficiently 
homologous to any portion of the mutant PINl PPIase:Compound I complex can be 
resolved by this method. 

[0180] In another preferred embodiment, the method of molecular replacement is 
30 utilized to obtain structural information about another PPIase. The structure 

coordinates of PINl PPIase as described herein are usefiil in solving the structure of 
other isoforms of PINl or other PINl containing complexes. 
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[0181] Furthermore, the structure coordinates of the PBSfl PPIase polypeptides, 
described herein, are useful in solving the structure of other PINl proteins that have 
amino acid substitutions, additions and/or deletions. These PINl mutants may 
optionally be ciystallized in complex with a chemical entity, such as Compound 1. 
5 The crystal structure of such a complex may then be solved by molecular replacement 
and compared with structure of the PINl PPIase polypeptides described. Potential 
sites for modification within the various binding sites of the enzyme may thus be 
identified. This information provides an additional tool for determining the efficient 
binding interactions, for example, increased hydrophobic interactions, between PINl 
10 PPIase and a chemical entity. 

[0182] The structure coordinates are also useful to solve the structure of crystals of 
PINl or PINl homologues complexed with chemical entities. This approach enables 
the determination of the important sites for interaction between chemical entities, 
including potential PINl modulators with the PINl substrate-binding site. For 
15 example, high resolution X-ray diffraction data collected fi-om crystals exposed to 
different types of solvent allows the determination of where each type of solvent 
molecule resides. Small molecules that bind tightly to those sites can then be 
designed and synthesized and tested for their ability to modulate PINl PPIase activity. 

[0183] All of the complexes referred to above may be studied using known X-ray 
20 diffraction techniques and may be refined versus 1 .5-3.0 A resolution X-ray data to an 
R value of about 0.20 or less using computer software, such as X-PLOR (Brunger, 
1992, supra^ distributed by Accelrys, Inc., San Diego, CA, This information may be 
used to optimize known PINl PPIase modulators, and to design new PINl PPIase 
modulators. 

25 E. Peptidyl-Prolyl Isomerase Assay 

[0184] PINl is a phosphorylation dependent peptidyl-prolyl isomerase. Peptidyl- 
prolyl ismomerase activity for the peptides of the invention can be measured using a 
spectrophotometric assay based on the coupled chymotrypsin catalyzed, cis-trans 
conformation dependent cleavage of a para-nitroanaline-containing peptide substrate. 
30 This rotamase assay is described by Kofron et ah {Biochemistry 30, 6217-6134 
(1991)) and its application to PINl isomerase activity is described by Yaffe et al. 
{Science 278, 1957-1960 (1997)). Cleavage of the isomerized peptide releases para- 
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nitroanaline, which can be monitored by an increase in absorbance at 390 nm. The 
PINl peptide substrate, succinyl-alanine-leucine-proline-phenylalanine- 
paranitroaniline (Suc-AEPF-pNA) (Bachem California, Inc., Torrence, CA) is kept in 
a predominantly cis conformation with an anhydrous TFE (trifluorethanol)/LiCl 

5 (lithium chloride) solvent mixture. Upon dilution into an aqueous assay mixture 
containing peptides with PINl PPIase activity, the peptide substrate undergoes PINl 
catalyzed isomerization to the trans conformation. Chymotrypsin or other suitable 
protease, such as Subtilisin Carlsberg cleaves the trans product to form free para- 
nitroanaline. To minimize the spontaneous isomerization of the peptide substrate, 

10 reactions are performed at 1 5 °C. Using this method, both the wild-type PINl and 
mutant PINl PPIase (K77Q/K82Q) (SEQ ID N0:4) at a concentration of 0.033 nM 
with 100 )iM Suc-AEPF-pNA had a rate of 0.2. The Ki of Compound I (Example 1) 
and K77Q/K82Q was 0.06 ^iM. 

[0185] The following examples are for the purpose of illustrating various 
1 5 embodiments and features of the invention 

Examples 

Example 1 Synthesis of a PINl inhibitor-compound I 

[0186] Compound I was synthesized according to scheme I. The abbreviations 
employed in Scheme I have the following meaning unless otherwise indicated: 
20 CBzCl=Benzyl chloroformate; MCPBA=3-chloroperoxybenzoic acid; Pdipalladium; 
ETOH=ethyl alcohol; EtOAc=ethyl acetate; Ph=phenyl; and Bn=benzyl. 

[0187] Synthesis of Compound I- Scheme 1: 
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4 (Compound I) 



Example lA 



)-OBn 




5 [0188] Alcohol 1: To a methylene chloride solution (80 mL) of D'-phenylalanihol 
(1.15 g, 7.61 mmol) was added triethylamine (1.59 mL, 11.4 mmpl) and benzyl 
chloroformatep.l9 mt, 8.37 mmoljf The mixture was stiired foi' 3 hours (h) and j 
then concentrated. The residue was dissolved in methylene chloride (50 mL) and 
washed with brine (1x50 mL). The solution was dried (Na2S04) and concentrated. 

10 After column chromatography purification (10 to 30% EtOAc in hexanes), the title 
compound was obtained in 73% yield (1 .59 g). 

NMR (CDCI3): 5 7.46-7.15 (lOH, m), 5.11 (2H, s), 4.96 (IH, m), 3.98 (IH, m), 
3.72 (IH, m), 3.63 (IH, m), 2.89 (IH, d, J=7.2 Hz). 

MS (ESP): 286 (M+tT); 284 (M-H) . 

15 Example IB 
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[0189] Phosphate Benzyl Ester 2: To an acetonitrile solution (40 mL) of the alcohol 
1 (1.58 g, 5.54 mmol) and IH-tetrazole (1.05 g, 15 mmol) was added dibenzyl N,N- 
diisopropylphosphoramidite (3.72 mL, 11.1 mmol) at 25 After 3h, MCPBA (4.19 
5 g, 70% pure, 13.85 mmol) was added to the suspension. The solution was diluted 
with EtOAc (100 mL), washed with concentrated NaHSOa solution (2x80 mL), dried 
over MgS04 and concentrated in vacuo. The residue was purified by column 
chromatography (10-30% EtOAc in hexanes) to give 2.88 g of the title compound in 
95% yield. 

10 NMR (CDCI3): 8 7.47-7.05 (20H, m), 5.19-4.96 (7H, m), 4.09-3.83 (3H, m), 2.93- 
2.67 (2H, m). 

MS (positive ESP): 568 (M+Na"); MS (negative ESP): 580 (M+Cl)". 
Example IC 



To an ethanol solution of the phosphate benzyl ester (2, 2.88 g, 5.28 mmol) was added 
palladium on carbon (10%, 300 mg). The suspension was kept under hydrogen 
atmosphere (latm) for 4 h, and was then filtered through a pad of Celite. The 
collected solid was washed with methylene chloride. The mixture of the solid and 
20 Celite was suspended in 5% HCl solution and stirred for 20 min. After filtration, the 
filtrate was concentrated to dryness, affording 1 .2 g of the title compound in 86% 
yield, 

*H NMR (CD3OD): 5 7.49-7.25 (5H, m), 4.22-4.08 (IH, m), 4.0 (IH, m), 3.72 (IH, 
m), 3.03 (2H, d, J=7.5 Hz). 

25 LCMS: 232 (M+H^; 230 (M-H)". 

HRMS (MALDI) calc for C9H15NO4P (M+lf ) 232.0733; found 232.0736. 




NH2 •HCl 



3 



15 [0190](2R)-2-amino-3-phenylp*ropyl-dihydrogen-phosphate-hydrochloride (3): 
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Example ID 




[0191] Compound I- Phosphoric acid mono-{(R)-2-[(l-benzo[b]thiophen-2-yl- 
methanoyl)-amino]-3-phenyl-propyI} ester (4): To a sodium carbonate solution 
5 (IM, 1 mL) was added the aminophosphate 3 (48 mg, 0.179 mmol) and 

benzothiophene-2-carbonyl chloride (35 mg, 0.179 mmol). After 15h, it was acidified 
to pH~l by addition of concentrated HCl solution at 0 ^C. Preparative HPLC 
purification gave 34 mg (48% yield) of the title compound. 

NMR (CD3OD): 5 7.96 (IH, s), 7.90 (2H, m), 7.43 (2H, m), 7.37-7.17 (5H, m), 
10 4.50 (IH, m), 4.10 (2^ m), 3.09 (IH, dd, J=13.9, 6.6 Hz), 3.00 (IH, dd, J=13.9, 7.8 
Hz). 

HRMS (MALDI) calc for CigHigNOsPSNa (M+Na"^) 414.0540; found 414.0536. 

Example 2 Cloning and biochemical analysis of PINl PPIase polypeptides 

[0192] The PPIase domain from wild-type PINl was amplified by PCR (MuUis et al., 
15 CSHSymp, Quantum Biol 51:263-273 (1986); Saiki et al., Science 239:487-491 
(1988)), using a pET3a vector (Novagen, Madison, WI) containing the coding 
sequence for full-length PINl. The primers used were as follows: 

Forward primer-5' AGCAGCCATATGGGCAAAAACGGGCAGGGGGAGCCT-3' 
(SEQIDN0:5) 

20 Reverse primer-5'-CTTGGATCCTCACTCAGTGCGGAGGATGAT-3' (SEQ ID 
N0:6) 

[0193] The amplified DNA was cloned into the Nde\ and BamHl sites of the bacterial 
expression vectors pET3a and pET28a (Novagen), and sequence verified. pET28a 
contains a 6 Histidine tag followed by a thrombin cleavage site. 
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[0194] The amino acid sequence of the PINl PPIase domain corresponds to amino 
acids 45-163 of full-length PINl (GenBank Accession No. XM_009024) and is 
shown below: 

45 GKNGQG EPARVRCSHL LVKHSQSRRP SSWRQEKITR TKEEALELIN 
5 GYIQKIKSGE EDFESLASQF SDCSSAKARG DLGAFSRGQM QKPFEDASFA 
LRTGEMSGPV FTDSGIHIIL RTE 163 (SEQ ID NO:7) 

[0195] The pET3a vector coded for a recombinant PINl PPIase polypeptide, which 
contained an additional M residue at the N-terminus. The pET28a vector expressed a 
recombinant PINl PPIase polypeptide, which upon thrombin cleavage, generated a 
10 polypeptide with four additional amino acids at the N-terminiis corresponding to the 
following amino acid sequence: 5'- GSHM-3'. 

Example 3 PINl PPIase K77Q/K82Q 

[0196] The double mutant, K77Q/K82Q, which contains the amino acid lysine instead 
of the amino acid glutamine at positions 77 and 88, was generated by the 
15 QuickChange^J^ site-directed mutagenesis method (Stratagene, La JoUa, CA) 

following the manufacturer's protocol and as described below (Catalog # 2005 1 8; 
revision # 108005h), using the pET28a PPIase vector and the following PGR primers: 

PIN1K77/82Q Forward: 

5'-GCGGCAGGAGCAGATCACCCGGACCCAGGAGGAGGCCCTGGAGC-3* 
20 (SEQIDN0:8) 

PIN1K77/82Q Reverse: 

5'-GCTCCAGGGCCTCCTCCTGGGTCCGGGTGATCTGCTCCTGCCGC-3'(SEQ 
IDN0:9) 

Mutagenesis Protocol: 

25 [0197] A sample reaction mixture was prepared by combining 5 jiil of lOx reaction 
buffer (100 mM KCl, 100 mM (NH4)2S04, 200 mM Tris-HCl (pH 8.8), 20 mM 
MgS04, 1% Triton® X-100, and 1 mg/ml nuclease-free bovine serum albumin 
(BSA)); 5-50 ng of dsDNA template; 125 ng of each primer; 1 fxl of dNTP mix; 
ddH20 to a final volume of 50 ^l. 
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[0198] To the sample reaction mixture was added 1 jxl of PfuTurbo® DNA 
polymerase (2.5 U/|xl). The reactions were overlayed with 30 jxl of mineral oil. Each 
reaction was cycled using the following cycling parameters: Segment 1- one cycle at 
95 for 30 seconds; Segment 2- 12 to 18 cycles at 95 °C for 30 seconds, 55 °C for 

5 one minute and 68 for 2 minutes/kb of plasmid length. After cycling, 1 |il of Dpnl 
restriction enzyme (10 U/|il) was added below the mineral oil overlay. The reaction 
mixtures were gently and thoroughly mixed and spun down in a microcentrifuge for 1 
minute. After centrifugation, the reactions were incubated at 37 °C for 1 hour to 
digest the parental supercoiled dsDNA. One ^il of the Z)/?«l -treated DNA fi-om each 

10 control and sample reaction were used to transform E, coli strain DH5a. 

[0199] The K77Q/K82Q PINl PPIase mutant was sequence verified. 

[0200] The amino acid sequence of the K77Q/K82Q PINl PPIase mutant is shown in 
Fig. 5. The amino acid sequence of the PPIase domain of the K77Q, K82Q PINl 
mutant is shown below. 

15 45GKNGQGEPARVRCSHLLVKHSQSRRPSSWRQEQITRTQEEALELIN 
GYIQKIKSGE EDFESLASQF SDCSSAKARG DLGAFSRGQM QKPFEDASFA 
LRTGEMSGPV FTDSGIHIIL RTE (SEQ ID NO: 10). 

Example 4 Purification and biochemical analysis of PINl PPIase polypeptides 
A. Fementation 

20 [0201] coli BL21(DE3) cells containing a PET28a vector encoding for either wild- 
type PINl PPIase or mutant PPIase K77Q/K82Q were inoculated into 5 ml of 2xYT 
media (per liter: 16 g tryptone, 10 g yeast extract, 5 g NaCl) containing 50 |ug/ml 
Kanamycin in a Falcon 2059 tube. This culture was shaken overnight at 250 rpm at 
37 "^C, The overnight culture was diluted 100-fold in 2xYT medium containing 50 

25 jig/ml kanamycin. The diluted culture was shaken at 250 rpm at 37 °C to an OD595 of 
from 0.6 to 0.8. 0.3 mM IPTG was added and the culture shaken overnight at 250 
rpm at 25 °C. The overnight cell culture was centrifuged at 5000 rpm for 20 min. 
The pellets were resuspended in lOx buffer A (50 mM Na3P04, pH 7.5, 0.5 M NaCl, 
20 mM imidazole, 5 mM 2-mercaptoethanol). The suspension v^as passed through a 

30 high-pressure microfluidizer. The homogenate was centrifuged down in a Beckman 
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ultracentrifuge at 40,000 rpm at 4 °C for 45 min. The clear supernatant was saved for 
further purification. 

B. Purification 

[0202] The clarified supernatant was loaded onto a Ni-NTA column (20 ml) at 4 
5 ml/min. The column was washed with 200 ml of buffer A. A linear gradient (400 ml) 
was run at 4 ml/min from 100% buffer A to 100% buffer B (50 mM Na3P04, pH 7.5, 
0.5 M NaCl, 500 mM imidazole, 5 mM 2-mercaptoethanol). The fractions were 

collected (6 ml) and sep^ated using SDS-PAGE (12%). The fractions containing 

i 

6xHis PINl PPIase were collected and pooled. The pooled fractions were dialyzed 

I 

10 against 4 liters of buffer C (25 mM HEPES pH 7.5, 100 mMNaCl, 5 mM 2- 
mercaptoethanol) overnight at 4°C. 

C. Thrombin cleavage 

[0203] To the pooled fractions containing 6xHis PINl PPIase was added biotinylated 
thrombin (1 unit per 10 mg protein). The solution was gently rotated overnight at 
15 4°C. The overnight solution was passed through a Ni-NTA column (5 ml) and a 
Streptavidin- Agarose column (1 ml). The flowthrough was collected and 
concentrated to about 10 mg/ml for further studies. 

D. PINl Peptidyl-Prolyl Isomerase Assay 

[0204] Peptidyl-prolyl isomerase reactions were carried out in 25 mM MOPS [3-(N- 
20 Morpholino)propanesufonic acid], pH 7.5, 0.5 mM TCEP [Tris(2- 

carboxyethyl)phosphine hydrochloride], 2% DMSO, 5 |li1 of a 25 mg/ml solution of 
Subtilisin Carlsberg Protease (Sigma), 50 nM PINl-PPIase, and 100 ^M Suc-AEPF- 
pNA peptide substrate. Reactions were cooled to 15 and initiated with the addition 
of Suc-AEPF-pNA. The absorbance at 390 nm was monitored continuously until all 
25 substrate had been converted to the cleaved product. This data, the progress curve, 
was then fitted to an exponential equation to determine a rate constant k for the 
reaction. The rate constant k is linearly proportional to the concentration of active 
enzyme present in the assay mixture once the rate constant for the spontaneous 
isomerization is subtracted. The Km for this substrate was much higher than 100 |liM 
30 ([S]«Km). Therefore, during the inhibition experiment, the IC50 for this non-tight- 
binding inhibitor was essentially Ki. Without an inhibitor present, both wild type 
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human PINl and mutant PINl PPIAse, at 0.033 nM with 100 uM Suc-AEPF-pNA, 
had a rate of 0.2. The Ki of Compound I and mutant PPIase K77Q/K82Q was 0.06 

E. Isothermal Titration Calorimetry 

5 [0205] The binding of Compound I to a His-tagged construct of the K77Q/K82Q 
PPIase domain ( Fig. 4) was studied by isothermal titration calorimetry (ITC) as 
follows. The titrations were performed in duplicate and the stated uncertainties are 
the standard deviations of the averaged results. , 

[0206] Following a preliminary 2 |iL injection, twenty to twenty-five 10 |liL injections 
10 of a 200 |LiM solution of the PPIase polypeptide was titrated into a 10 [iM solution of 
Compound 1. The titrations were performed using a VP-ITC (MicroCal, 
Northampton, MA) at 15.0 ^C with stirring set at 270 rpm, 4 minutes injection 
intervals, and a 20 second injection duration for the 10 pL injections. The working 
volume of the ITC cell was 1.414 mL. Both solutions contained 25 mM MOPS pH 
15 7.5, 0.5 mM TCEP and 2.0% DMSO (voL/voL). The PPIase polypeptide solution was 
prepared by exhaustively dialyzing a stock protein solution against several changes of 
dialysis buffer (25 mM MOPS pH 7.5, 0.5 mM TCEP) at 4.0 ^C. 

[0207] After dialysis the protein was centrifuged to remove any particulate matter. 
The protein concentration was then determined by absorbance using an extinction 

20 coefiRcient that had been calculated based on the tryptophan and tyrosine content of 
the protein. The dialysed protein was then diluted with the dialysate and 2.0% 
(volume to volume) DMSO was added to yield a final concentration of 200 
protein. A 20 mM Compound I stock solution was prepared by dissolving a small 
amount of the compound in DMSO. An aliquot of the stock solution was diluted in 

25 DMSO and then an appropriate volume of dialysate was added. The final DMSO 
concentration was 2.0% (volume to volume) and the final compound concentration 
was 10 ^M. 

[0208] Appropriate control titrations (buffer into buffer, buffer into compound, and 
protein into buffer) were performed to determine the heats of dilution. Prior to fitting 
30 for the binding parameters, the observed heats of binding were corrected for heat of 
dilution of the protein. The machine blank correction (buffer into buffer) and the heat 
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of dilution of the compound were comparable and as such were neglected when 
correcting for the heats of dilution. The data were fit using the ORIGIN® software 
package (MicroCal) provided with the ITC (Fig. 6). In Fig. 6, the solid line represents 
the best fit of the corrected binding data using the ORIGIN software package (ka=l .42 
5 X 10^ M'\ C value=142). The One Set of Sites model with ligand in the cell was 
selected. The lower than one to one stoichiometry that was observed is most likely 
the result of the presence of a small amount of inactive enzyme in the stock protein 
sample. This result was consistent with the observation of a slight reduction in the 
enzymatic activity of the protein sample. 

10 [0209] The stoichiometry, dissociation constant and enthalpy of binding were 

determined to be 0.854 (±0.003), 67 (±5) nM and -7.3 (±0.1) kcal/mol, respectively. 

Example 5 Crystallization of PPIase polypeptides and PPIase PPIase/Compound I 
Complex 

[0210] Crystals of the apoenzyme (thrombin cut PPIase K77Q/K82Q) were grown at 
15 1 3 ^'C via the hanging-drop vapor-diffiision method. Crystals were obtained by 
mixing equal volumes of protein solution (10-15 mg/ml protein) and reservoir 
solution of 1 .2-1 .4 M Na Citrate, with 0.1 M Hepes (or Borate, when pH >8.5) at a 
pH range of 7.5-10.0 (optimum pH = 8.8), and 5 mM DTT. Crystals typically grew 
within 3 days. For X-ray data collection, crystals were transferred into a cryo- 
20 protectant containing 20% glycerol in addition to the reservoir solution and flash 
fi-ozen in liquid nitrogen. The crystals, which were determined to belong to the 
monoclinic space group C2 with a= 1 16.84, b= 35.82, c= 51.40 A alpha= 90.0, beta= 
100.33, and gamma= 90.0 degrees, contained two molecules per asymmetric unit. 

[021 1] Crystals of thrombin cut PPIase K77Q/K82Q and Compound I were obtained 
25 by crystallization under conditions similar to those described above for the 

apoenzyme. The protein was diluted to 10 mg/ml, then exposed to Compound I 
(dissolved in 100% DMSO) by adding to a final concentration of 1 mM. The ratio of 
PPIase polypeptide to Compound I was 1 :5. The reservoir solution contained 1.4 M 
Na citrate, with 0.1 M Hepes at pH 7.5 (titrated with HCl) and 10 mM DTT. The 
30 resulting protein/Compound I solution was then incubated for 24 hours at 4 ^C, and 
filtered through a 0.45|aM cellulose-acetate membrane prior to setting up 
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crystallization experiments. Crystals grew within 3 days. The crystal :ligand 
complexes had the identical space group (C2) and similar cell dimensions as 
described above for the apoenzyme. 

Example 6 PPIase K77Q/K82Q structure solution 

5 [0212] The structure of the PPIase mutant K77Q/K82Q was solved by molecular 
replacement (MR) using EPMR software (Kissinger et ah. Acta Cryst, 055:484-491 
(1999)), with residues 55-163 of the native PINl structure as the MR probe. The R- 
factor for the correctly positioned and oriented dimer was 39.7% for data in the 10-4.0 
A range. The MR solution was refined by ARP/wARP (EMBL) to an R-factor of 

10 1 7.6% to produce a SIGMAA weighted 2Fo-Fc map for fitting. Refinement was 
carried out using simulated annealing and conjugate gradient minimization protocols 
in the program X-PLOR (Bninger, 1992, supra) (see Table I for refinement statistics). 
The final model included all atoms for residues 51-163 in molecule A (excluding the 
side chain atoms of residues 69 and 87), all atoms for residues 54-163 in molecule B 

15 (excluding the side chain atoms of residues 69, 94, and 95) plus 242 waters. The 
structure coordinates for the apoenzyme are given in Table 11. 

Example 7 PPIase K77Q/K82Q complexed with Compound I structure solution 

[0213] Protein atomic coordinates from the crystal structure of PPIase K77Q/K82Q 
were used to initiate rigid-body refinement in X-PLOR followed by simulated 

20 annealing and conjugate gradient minimization protocols. Placement of the inhibitor 
and addition of ordered solvent into diflFerence electron density maps was followed by 
subsequent rounds of refinement using X-PLOR (see Table I for refinement statistics). 
The final model included all atoms for residues 51-163 in molecule A (excluding the 
side-chain atoms of residue 87), all atoms for residues 54-163 in molecule B 

25 (excluding the side-chain atoms of residues 94 and 95) plus Compound I and 181 

waters. Inhibitor occupancy in molecule B was lower than that observed for molecule 
A. 

[0214] The results fi'om the crystallographic analysis are shown in Table I below. 
Crystal structure coordinates are set forth in Table III. 

30 

[0215] Table I. Statistics for crystallographic analysis 
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PPIase(K77/K82Q) 
PPIase(K77/K82Q)+Compound I 



Resolution (A) 1.85 2.00 

Reflections measured 501 17 65503 

5 Unique reflections 16272 14274 

Completeness (%) 89.5(53 .4) 97.9 

R^ym 4.3(12.6) 5.8(17.1) 

Rcryst^ (%) 20.9 20.3 

%yn = S|I( h)-I(h)i|/ZI(h)i*100 ( 

10 h,i=l h,i=l I 



Where I(h)i is the i^ measurement of reflection h and I( h ) is the mean value of the N 
equivalent reflections 

^Rcryst = 2 II Fobsl " | Fcalc|| / 2| Fobs| 

Where the summation is over data used in the refinement 

15 Example 8 High-throughput assay utilizing peptides that contain the PINl PPIase 
domain 

[0216] This assay is based on fluorescence polarization. In fluorescence polarization 
detection, monochromatic light passes through a polarized filter and excites molecules 
in the sample well. Only those molecules that are oriented properly in the polarized 

20 plane absorb light, become excited, and subsequently emit light. The emitted light is 
detected after passing through polarizing filters that are oriented parallel and 
perpendicular to the plane of excitation. Since small molecules rotate more quickly 
than large molecules (e.g. in the form of a bound complex), the parallel (S) and 
perpendicular (P) measurements are closer and the difference is lower. Fluorescence 

25 polarization is measured in mP (milliP) which is defined using the following equation: 
mP=1000*(S-P)/(S+P) 

[0217] For the PINl assay, library compounds compete with fluorescein-tagged 
Pintide to bind the PPIase domain of PINl. After a short incubation, samples are 
assayed using fluorescence polarization. The excitation and emission of fluorescein 
30 occur at 485 nm and 530 nm, repectively. The assay is homogeneous and performed 
with or without the presence of library compounds. Formation of a complex between 
fluorescein-tagged Pintide and the PPIase domain of PINl leads to large differences 
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between the S and P measurements, resulting in high mP values. Compounds that 
bind to the PPIase domain of PIN 1 and prevent the formation of this complex lower 
the mP values. 

Materials and Reagents 

5 Experiments were performed in either 96-well plates or 384-well black flat bottom 
polystyrene non-binding surface (NBS) plates (Costar). The PPIase substrate was a 
fluorescein-tagged Pintide, FL-WFYpSPFLE (SEQ ID NO: 11) where pS equals 
phosphorylated serine. The inhibitor control was Pintide without the fluorescein tag. 
Fluorescent Pintide was either purchased (AnaSpec, Inc., San Jose, CA) or 

10 synthesized as described herein. The buffer conditions were .25 mM MOPS [3-(N- 
Morpholino)propanesufonic acid], and 0.5 mM TCEP [Tris(2-carboxyethyl)phosphine 
hydrochloride], at pH 7.5. For inhibitor controls, free Pintide was used at 50, 10, and 
2 |jM (IC50 of free Pintide is about 7-10 |liM). Excitation was measured at 485 nm 
and emission was measured at 530 nm. Readings were taken in a Florescence 

15 Polarization reader (Molecular Devices Analyst). 

Pintide Synthesis 

[021 8] Pintide (WFYpSPFLE) was synthesized on an Applied Biosystems 433 A 
Peptide Synthesizer on a O.lmmol scale using standard Fmoc chemistry and preloaded 
HMP resin. After thorough washing with dichloromethane (DCM) (Fisher), the 

20 peptide was cleaved from the resin and deprotected in trifluoroacetic acid (TFA) 

(Aldrich) with ethanedithiol and thioanisole present as scavengers. The solution was 
filtered into cold m-tert butyl ether (MTBE) (Aldrich) to precipitate the peptide and 
centrifuged at 6 Krpm for 3 minutes. The resulting pellet was washed and centrifuged 
in cold MTBE four times then dried under vacuum. The dried precipitate was 

25 resuspended and lyophilized overnight. 

[0219] Purification was performed on an ISCO 2350 HPLC with a Linear LS500 
scanning detector and a Foxy II fraction collector. The purification conditions were 
as follows: mobile phase was 0.1%TFA:H2O and eluent was 0.1%TFA:CH3CN 
(acetonitrile (Omnisolve, VWR)); the gradient was 5% to 95% in 30 minutes on a 25 
30 X 1cm Hypersil ODS (5 (im, 300A, Phenomenex); the flow rate was 2.5 ml/min; and 
fractions at 30-second intervals. 
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[0220] Fractions were analyzed on an HP 1050 with the same buffer system and 
gradient on a 100 x 4.6mm Hypersil ODS column (Hewlett-Packard). Pure product 
(Pintide; elution time = 12.22 minutes) was lyophilized overnight. Compound 
identity was confirmed by MALDI-TOF mass spectroscopy. 

5 [0221] Fluorescein modification was carried out following the basic protocol 
published by Molecular Probes (MP-00143 ; 8/19/98) as described below. 

[0222] Twenty mg of purified, lyophilized Pintide (peptide content -60% so, 12mg 
actual peptide) was resuspended in 1 .75 ml of 0. IM NaHCOs (sodium bicarbonate) 
(Sigma), pH 8.3. 3.3 mg fluorescein-5-EX succinimidyl ester (Molecular Probes #F- 

10 6130) was resuspended in DMSO at 10 mg/ml (330 jxl). 165 |xl of this solution was 
added dropwise to the peptide solution under continuous stirring at room temperature. 
After 30 minutes, the remaining 165 ^1 was added dropwise under continuous stirring. 
After 60 minutes, the solution was loaded on the HPLC (under conditions described 
previously) to stop the reaction and facilitate purification. Fractions were analyzed as 

15 previously described and the product (fluroescein-tagged Pintide; elution time = 14.58 
minutes) was lyophilized overnight. Compound identity was confirmed by MALDI- 
TOF mass spectroscopy. 

Assay Plate Format and Screening Conditions for 96-well plates: 

[0223] Forty-five of assay buffer containing 20 |aM fluorescein-Pintide was 
20 dispensed into each of the wells. Test compounds (1 |iL of a 0.5 mM stock 

concentration in DMSO) were added to columns 1-22. The 6His-PPIase domain of 
PINl (5 ^iL of a 4 \jM solution in assay buffer) was added to all wells in columns 1-22 
and most wells in columns 23-24. The following controls were used in columns 23- 
24: wells A23-F24 were DMSO controls and were used to calculate the maximum 
25 value; wells G23-H24, 123-J24, and K23-L24 were inhibitor controls at 50 ^iM, 10 
pM, and 2 nM free Pintide, respectively; and wells M23-P24 contained no PPIase and 
were used to calculate the minimum value. The assay was incubated at room 
temperature for 10 minutes and immediately read at excitation 485 nm and emission 
530 nm in fluorescence polarization mode. The percent inhibition of each well was 
30 calculated using the following equation: 

% inhibition = 100* (1- (mPwell -Minaverage)/(MaXaverage - Minaverage)) 
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[0224] The order of addition can be changed. For example, in a variation of the 
present assay, compounds can be added to the plate first, followed by fluorescein- 
Pintide in asssay buffer, and finally 6His-PPIase. As currently designed, the assay is a 
competition assay. 

5 [0225] The premise of the assay is different when the fluorescein-Pintide and 6His- 
PPIase are added first followed by compound addition. The fluorescein-Pintide and 
6His-PPIase preform a complex. When the compound is added, it must displace the 
fluorescein-Pintide from the binding site. This may occur depending on the Kd of the 
compound; however, a longer incubation is required. 

10 [0226] The foregoing description has been provided to illustrate the invention and its 
preferred embodiments. The invention is intended not to be limited by the foregoing 
description, but to be defined by the appended claims. 



